DocFetcher

DocFetcher
Developer(s)	DocFetcher project
Stable release	1.1.22 / Jul, 30, 2018
Written in	Java
Operating system	MS Windows, Mac OS X, Linux
License	Eclipse Public License
Website	https://sourceforge.net/projects/docfetcher/

DocFetcher is an open source desktop search application. It is written in Java and runs on Windows, Mac OS X and Linux.^[1] The application has a graphical user interface, which is written using the Standard Widget Toolkits.^[2] Indexing and search are based on Apache Lucene,^[2] a widely used open source search engine.

Features

Unicode support
Full text search for all major document file formats, including:
- Office files (Microsoft Office, OpenOffice, Outlook (PST), ...)
- EPUB, PDF
- RTF, SVG and any other plain text files
- Audio metadata (MP3, FLAC)
- Picture metadata (JPEG)
- Archive formats (ZIP, 7z, RAR, Tar). Also supports nested archive files
- HTML with pair detection. Which means that DocFetcher detects when an HTML file and a folder containing the resource files (Images, Scripts, ...) of the page belong together. (These resource files are usually downloaded when saving a Website)
Possibility to automatically detect file changes and update the index accordingly
Exclusion of files from indexing based on regular expressions
A query language supporting boolean operators (OR, AND, NOT), wildcards, phrase search, fuzzy search and proximity search
Translations in Chinese, Italian, Ukrainian. Partly translated to French, Japanese, Spanish, and German.^[3]

^ DocFetcher homepage, retrieved 2016-12-04
^ ^a ^b "start - DocFetcher-Wiki". docfetcher.sourceforge.net. Retrieved 2020-05-18.
^ "DocFetcher translation on Transifex". www.transifex.com. Retrieved 2020-02-11.{{cite web}}: CS1 maint: url-status (link)