From Wikipedia, the free encyclopedia
DocFetcher is an open source desktop search application. It is written in Java and runs on Windows , Mac OS X and Linux .[ 1] The application has a graphical user interface , which is written using the Standard Widget Toolkits .[ 2] Indexing and search are based on Apache Lucene ,[ 2] a widely used open source search engine.
Features
Unicode support
Full text search for all major document file formats, including:
Office files (Microsoft Office , OpenOffice , Outlook (PST ), ...)
EPUB , PDF
RTF , SVG and any other plain text files
Audio metadata (MP3 , FLAC )
Picture metadata (JPEG )
Archive formats (ZIP , 7z , RAR , Tar ). Also supports nested archive files
HTML with pair detection. Which means that DocFetcher detects when an HTML file and a folder containing the resource files (Images, Scripts, ...) of the page belong together. (These resource files are usually downloaded when saving a Website)
Possibility to automatically detect file changes and update the index accordingly
Exclusion of files from indexing based on regular expressions
A query language supporting boolean operators (OR
, AND
, NOT
), wildcards , phrase search , fuzzy search and proximity search
Translations in Chinese, Italian, Ukrainian. Partly translated to French, Japanese, Spanish, and German.[ 3]
See also
References
External links