Article snapshot taken from Wikipedia with creative commons attribution-sharealike license.
Give it a read and then ask your questions in the chat.
We can research this topic together.
The program is an indexing search tool, meaning it has a local database of file content that it checks, rather than looking over all files on your machine. This means the program must always be running to monitor changes, but search results are instant. Search tools are based on Apache Lucene software, a widely-used, open source search engine.
Features
Unicode support
Full text search for all major document file formats, including:
HTML with pair detection. Which means that DocFetcher detects when an HTML file and a folder containing the resource files (Images, Scripts, ...) of the page belong together. (These resource files are usually downloaded when saving a website)
Possibility to automatically detect file changes and update the index accordingly