1

I am looking for a search engine that finds documents from my local database on the basis of a search query. The process does not involve any web pages. They include only a set of text documents(pdf and doc files). Please suggest a few efficient search engines for this process.

siddharth
  • 153
  • 9

3 Answers3

1

I would try Solr first (it is built on top of Lucene). Comes with a nice web based admin tool also.

Andy
  • 328
  • 3
  • 13
  • great. my requirement would just be to enter a query and obtain the best set of documents from a local database with a good accuracy. – siddharth Jan 02 '12 at 04:40
  • Try Ambar http://ambar.rdseventeen.com/, it's very easy to use and have simple REST API – Ilia P Jan 20 '17 at 09:09
1

Tools

  1. Solr : Powerful text search and dynamic clustering. Try the example docs present in it.

  2. Mahout : Popular clustering algos are already present in it

  3. Lucene : Full-featured text search engine library

In my opinion Solr will be the best option for Search purpose.

Debaditya
  • 2,223
  • 1
  • 22
  • 45
0

I would suggest Lucene /Lucene.Net, however alternatives can be found here and here.

Community
  • 1
  • 1
Kane
  • 15,657
  • 11
  • 56
  • 82
  • thanks. But i would like to know which of these is the most efficient? can you help me out with that ? – siddharth Jan 02 '12 at 04:28
  • Without knowing your requirements it will be impossible to answer which is the most efficient / best as each solution has its specific pros and cons. – Kane Jan 02 '12 at 04:30
  • my database will have around 200 text documents (pdfs mostly). And my task involves using a reference document to generate many queries and then use these queries to search for the document that matches the most with the reference document. So i will be having quite a lot of queries and I am expecting atleast an average accuracy. – siddharth Jan 02 '12 at 04:33