Recoll

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Recoll
Developer(s) Jean-François Dockes
Stable release 1.19.12 / April 2, 2014; 4 months ago (2014-04-02)[1]
Development status Active
Written in C++ and Python
Operating system Unix-like
Type Search tool
License GPL
Website www.recoll.org

Recoll is a desktop search tool that provides efficient full text search (from single-word to arbitrarily complex boolean searches) in a friendly GUI, with minimum technical sophistication and few mandatory external dependencies. It runs under many Unix-like operating systems, and is mostly independent of the desktop environment.

Recoll was designed not to require a permanent daemon but on Linux systems Recoll can make use of inotify. Recoll updates its index at designed intervals (for example through Cron tasks) but if desired, the indexing task can run as a file-system monitoring daemon for real-time index updates.[2]

The Recoll document conversion and text extraction architecture makes it extremely easy to write new filters,[3] and many document types are supported.

Features[edit]

  • Qt GUI.
  • Xapian backend.
  • Indexes the contents of many document types: text, HTML, E-Mail stores of all kinds, OpenOffice.org, Microsoft Office and Office Open XML, AbiWord, KWord, Gaim, Lyx, Scribus, PDF, WordPerfect, PostScript, RTF, TeX, DVI, DjVu, MP3 and other audio file formats, JPEG and other image file formats.[4]
  • Recursively processes embedded documents (E-Mail attachments, Zip archives) to arbitrary depths.
  • Powerful query facilities, with boolean searches, wildcards, phrases, proximity, filter on file types and directory tree. GUI Boolean search build tool.
  • Xesam query language support
  • Word stemming is performed at query time (can switch stemming language after indexing).
  • Multiple indexes selectable at query time (i.e.: personal + system indexes).
  • Natively based on Unicode. Supports many languages and input character sets, including good support for east Asian texts (CJK).
  • MD5 document hashes for the elimination of duplicates in result lists.
  • Batch and real-time indexing modes.
  • Python API.
  • Kicker (KDE) applet for easy launching.
  • Easy installation. No database daemon, web server or exotic language necessary.

See also[edit]

References[edit]

External links[edit]