Xapian

Xapian

Infobox Software
name = Xapian



caption =
developer = xapian.org
latest_release_version = 1.0.8
latest_release_date = 2008-09-04
latest_preview_version =
latest_preview_date =
operating_system = Cross-platform
platform =
genre = Search and index API.
license = GNU General Public License
website = [http://xapian.org/ xapian.org]

Xapian is an open source probabilistic information retrieval library, released under the GNU General Public License (GPL). That is, it is a full text search engine library for programmers.

It is written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby. Xapian is highly portable and runs on Linux, Mac OS X, FreeBSD, NetBSD, OpenBSD, Solaris, HP-UX, Tru64, IRIX, and Microsoft Windows.

Xapian is designed to be a highly adaptable toolkit to allow developers to easily add advanced indexing and search facilities to their own applications.

A growing number of organisations and projects are known to be using Xapian including Orange, Gmane, Die Zeit, Delicious, and One Laptop per Child. [ [http://xapian.org/users.php Users of Xapian] ]

Features

* Transactions: if database update fails in the middle of a transaction, the database is guaranteed to remain in a consistent state.
* Simultaneous search and update, with new documents being immediately visible.
* Support for large databases: Xapian has been proven to be scalable to hundreds of millions of documents.
* Accurate probabilistic ranking: more relevant documents are listed first.
* Phrase and proximity searching.
* Relevance feedback, which improves ranking and can expand a query, find related documents, categorise documents etc.
* Structured Boolean queries, e.g. "race AND condition NOT horse"
* Wildcard search, e.g. "wiki*"
* Spelling correction
* Omega, a packaged solution for adding a search engine to a web site or intranet. Omega can easily be extended and adapted to fit changing requirements.

References

External links

* http://www.xapian.org is the Xapian project website.
* [http://www.oligarchy.co.uk/xapian.html Oligarchy Ltd.] and [http://www.lemurconsulting.com/Products/Xapian/Overview.shtml Lemur Consulting Ltd.] offer commercial support, consultancy and bespoke development for Xapian.
* [http://code.google.com/p/xappy/ http://code.google.com/p/xappy/] is a set of feature rich Python bindings for Xapian.
* [http://code.google.com/p/flaxcode/ http://code.google.com/p/flaxcode/] is a project to build a complete open-source enterprise search engine based on Xapian.
* [http://www.recoll.org Recoll] is a desktop search tool based on Xapian.


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • Xapian — es una biblioteca de código abierto de recuperación probabilística de información, publicada bajo la Licencia Pública General (GPL/GNU). Es decir, es una biblioteca completa que sirve como motor de búsqueda de texto para programadores. Está… …   Wikipedia Español

  • Xapian — est un moteur de recherche open source, diffusé sous licence GPL. Il est écrit en C++, avec des extensions qui permettent de l utiliser à partir de langages tels que Perl, Python, PHP, Java, Tcl, C#, et Ruby. Xapian est un outil très souple qui… …   Wikipédia en Français

  • MoinMoin — MoinMoin …   Википедия

  • Список поисковых машин — …   Википедия

  • ht-//Dig — ht://Dig est un moteur libre d indexation et de recherche dans des contenus Web. Il est sous licence GPL, fonctionne sous Windows et Linux, en ligne de commande et par interface Perl ou PHP. La dernière version a été publiée en 2004. Voir… …   Wikipédia en Français

  • List of search engines — This is a list of Wikipedia articles about search engines, including web search engines, metasearch engines, desktop search tools, and web portals and vertical market websites that have a search facility for online databases.By… …   Wikipedia

  • Ht-//Dig — Infobox Software name = ht://Dig caption = developer = latest release version = 3.2.0b6 latest release date = June 16, 2004 operating system = Unix programming language = C++ genre = Web search engine license = GNU General Public License website …   Wikipedia

  • Lucene — Developer(s) Apache Software Foundation Stable release 3.4 / September 14, 2011; 2 months ago ( …   Wikipedia

  • Full text search — In text retrieval, full text search refers to a technique for searching a computer stored document or database. In a full text search, the search engine examines all of the words in every stored document as it tries to match search words supplied …   Wikipedia

  • Flax (disambiguation) — Flax may mean or refer to:* Species in the genus Linum , including Flax ( Linum usitatissimum ) * Hemp ( Cannabis sativa ) * Flax (color) * New Zealand flax (unrelated to the Linum sp ) * Open Source enterprise search software based on Xapian …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”