Jump to content

Xapian

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by JorgePeixoto (talk | contribs) at 16:08, 31 March 2018 (Added Cite web template for references; updated latest release, language bindings, supported platforms and users; added sources; disambiguated Erlang; fixed wikilink to search engine; added Unicode 9.0 support to the Features section; other changes). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Xapian
Developer(s)xapian.org
Stable release
1.4.5[1] / October 16, 2017; 7 years ago (2017-10-16)
Repository
Written inC++
Operating systemCross-platform
TypeSearch and index API.
LicenseGNU General Public License
Websitexapian.org

Xapian is an Free and open source probabilistic information retrieval library, released under the GNU General Public License (GPL).[2] It is a full text search engine library for programmers.

It is written in C++, with bindings to allow use from Perl, Python (2 and 3), PHP (5 and 7), Java, Tcl, C#, Ruby, Lua, Erlang, Node.js and R[3][4]. Xapian is highly portable and runs on Linux, OS X, FreeBSD, NetBSD, OpenBSD, Solaris, HP-UX, AIX, Windows, OS/2[5][2] and Hurd, [6][7] as well as Tru64.[citation needed]

Xapian allows developers to add advanced indexing and search facilities to their own applications. Organisations and projects using Xapian include the Library of the University of Cologne, Debian, Die Zeit, MoinMoin, and One Laptop per Child.[8]

Features[2]

  • Supports Unicode 9.0 (including codepoints beyond the BMP) and stores indexed text in UTF-8.
  • Transactions: if database update fails in the middle of a transaction, the database is guaranteed to remain in a consistent state.
  • Simultaneous search and update, with new documents being immediately visible.
  • Support for large databases: Xapian has been proven to scale to hundreds of millions of documents.
  • Accurate probabilistic ranking: more relevant documents are listed first.
  • Phrase and proximity searching.
  • Relevance feedback, which improves ranking and can expand a query, find related documents, categorise documents etc.
  • Structured Boolean queries, e.g. "race AND condition NOT horse"
  • Wildcard search, e.g. "wiki*"
  • Spelling correction
  • Synonyms
  • Omega, a packaged solution for adding a search engine to a web site or intranet. Omega can easily be extended and adapted to fit changing requirements.

See also

References

  1. ^ https://xapian.org/docs/xapian-core-1.4.5/NEWS (accessed on 2018-03-31)
  2. ^ a b c "The Xapian Project : Features". Retrieved 2018-03-31.
  3. ^ "The Xapian Project". Retrieved 2018-03-31.
  4. ^ RXapian
  5. ^ "Version 1.0.6 of Xapian and Omega". freshmeat.net. Archived from the original on 2011-06-12. Retrieved 2018-03-31.
  6. ^ Debian built success report for GNU Hurd[permanent dead link]
  7. ^ "Debian -- Package Search Results -- xapian". Retrieved 2018-03-31.
  8. ^ "Xapian Users". Retrieved 2018-03-31.