DocSearcher is a document search tool. It uses the Open Source
Apache Lucene, Apache POI and Apache PDF Box
software to provide searching capabilities for
HTML,
MS Word (including docx),
MS Excel (including xlsx),
MS Powerpoint (including pptx),
RTF,
PDF,
LibreOffice, OpenOffice, StarOffice,
OpenDocument
and
text documents.
No other file format are currently supported.
To perform a search you simply type in the keywords or phrases you are looking for
and click the search button.
The search results will be presented in a table.
You can click on files as you would in a standard web browser.
There are 4 types of searches you can perform with DocSearcher:
Keyword, Phrase, Boolean, and Wild Card.
The example queries below describe each of these type:
Example search | What they do |
---|---|
summer vacation -with "keyword" as "type of search" |
Finds documents with the keywords "summer" and "vacation" This type of search is called a keyword search. |
summer vacation -with the "phrase" as "type of search" |
Finds documents with the phrase "summer vacation" This type of search is called a phrase search. |
"john smith" -"john brown" |
Finds documents with the "john smith" but excludes document with "john brown" This type of search is called a boolean search. |
virtu* |
Finds documents with words that start with "virtu...". For example virtuous, virtue, virtuosity, etc... This type of search is called a wild card search. |
For more information about Lucene you can go to the Lucene website:
http://lucene.apache.org/.
For more information and support you can go to the DocSearcher website:
http://www.docsearcher.de.