Questions tagged [lucene]

The term Lucene refers to the open source Java fulltext search engine library, but also to the entire eco-system that grew around it, including lucene.net, solr, elasticsearch and zend-search-lucene.

The term "Lucene" refers to the open source Java fulltext search engine library, and also to the entire eco-system that grew around it, including , , and . "Lucene" may also be used to refer to top-level projects like Nutch and Tika which were once sub-projects of Lucene.

Use the "Lucene" tag if either:

  • The question is about the Java library
  • The question is about a port of the library, but would make sense to people who know the Java library (many Lucene.NET questions match this criteria).
  • The question is so general it doesn't apply to a specific implementation (example).

References:

Basic Demo:

A basic "getting started" demo showing how to build and query an index is provided as part of the official documentation:

Basic Demo documentation - (this link is for Lucene v8.7.0. Newer versions may be available)

Links to the demo's source files are provided in the above documentation.

The source code can also be found here on GitHub.

Luke - a Lucene GUI Client:

Luke is a GUI client application which can be used to explore your Lucene indexes. Recent versions of Luke are now provided as part of each binary release, which can be downloaded from here.

After downloading the binary release, unzip it, and go to the luke directory. Launch the client using the provided luke.bat or luke.sh scripts.

11633 questions
745
votes
12 answers

Solr vs. ElasticSearch

What are the core architectural differences between these technologies? Also, what use cases are generally more appropriate for each?
Ben ODay
  • 19,926
  • 7
  • 41
  • 67
440
votes
9 answers

ElasticSearch, Sphinx, Lucene, Solr, Xapian. Which fits for which usage?

I'm currently looking at other search methods rather than having a huge SQL query. I saw elasticsearch recently and played with whoosh (a Python implementation of a search engine). Can you give reasons for your choice(s)?
dzen
  • 6,771
  • 5
  • 24
  • 31
318
votes
9 answers

Comparison of full text search engine - Lucene, Sphinx, Postgresql, MySQL?

I'm building a Django site and I am looking for a search engine. A few candidates: Lucene/Lucene with Compass/Solr Sphinx Postgresql built-in full text search MySQl built-in full text search Selection criteria: result relevance and…
Continuation
  • 11,664
  • 19
  • 77
  • 102
282
votes
10 answers

NoSQL (MongoDB) vs Lucene (or Solr) as your database

With the NoSQL movement growing based on document-based databases, I've looked at MongoDB lately. I have noticed a striking similarity with how to treat items as "Documents", just like Lucene does (and users of Solr). So, the question: Why would…
eduncan911
  • 16,051
  • 11
  • 61
  • 99
192
votes
5 answers

Choosing a stand-alone full-text search server: Sphinx or SOLR?

I'm looking for a stand-alone full-text search server with the following properties: Must operate as a stand-alone server that can serve search requests from multiple clients Must be able to do "bulk indexing" by indexing the result of an SQL…
knorv
  • 45,461
  • 71
  • 205
  • 289
171
votes
7 answers

Difference between solr and lucene

I know that Lucene and Solr are 2 differents Apache projects that are made to work together, but I don't understand what is the aim of each project. For what I understood until now is that Lucene is used to create a search index and Solr use this…
darkheir
  • 8,317
  • 6
  • 43
  • 64
168
votes
4 answers

What is the difference between Lucene and Elasticsearch

I know ElasticSearch is built upon Apache Lucene but I want to know the significant differences between the two.
Satish Madiwal
  • 2,048
  • 2
  • 17
  • 19
120
votes
8 answers

Elasticsearch vs Cassandra vs Elasticsearch with Cassandra

I am learning NoSQL and looking at different options for one of my client's requirements. I have gone through various resources before putting up this question (a person with little knowledge in NoSQL) I need to store data at faster rate and read…
Reddy
  • 16,522
  • 43
  • 127
  • 190
116
votes
7 answers

How to query SOLR for empty fields?

I have a large solr index, and I have noticed some fields are not updated correctly (the index is dynamic). This has resulted in some fields having an empty "id" field. I have tried these queries, but they didn't work: id:'' id:NULL id:null …
user188962
108
votes
2 answers

Comparison of Lucene Analyzers

Can someone please explain the difference between the different analyzers within Lucene? I am getting a maxClauseCount exception and I understand that I can avoid this by using a KeywordAnalyzer but I don't want to change from the StandardAnalyzer…
Journeyman
  • 9,339
  • 14
  • 73
  • 121
104
votes
4 answers

How does lucene index documents?

I read some document about Lucene; also I read the document in this link (http://lucene.sourceforge.net/talks/pisa). I don't really understand how Lucene indexes documents and don't understand which algorithms Lucene uses for indexing? On the above…
Mahdi Amrollahi
  • 2,602
  • 5
  • 23
  • 34
90
votes
4 answers

How does Lucene work

I would like to find out how lucene search works so fast. I can't find any useful docs on the web. If you have anything (short of lucene source code) to read, let me know. A text search query using mysql5 text search with index takes about 18…
Midhat
  • 16,422
  • 17
  • 84
  • 113
84
votes
7 answers

using OR and NOT in solr query

I'm working on a solr query similar to the following: ((myField:superneat AND myOtherField:somethingElse) OR NOT myField:superneat) When running this, no results are returned. Using criteria on either side of the OR NOT returns results that I'd…
stolenricecakes
  • 943
  • 1
  • 7
  • 7
78
votes
2 answers

What does percolator mean/do in elasticsearch?

Even though I read the documentation for Elasticsearch to understand what a percolator is. I still have difficulty understanding what it means and where it is used in simple terms. Can anyone provide me with more details?
Hossein
  • 33,283
  • 53
  • 126
  • 172
75
votes
4 answers

How to get a Token from a Lucene TokenStream?

I'm trying to use Apache Lucene for tokenizing, and I am baffled at the process to obtain Tokens from a TokenStream. The worst part is that I'm looking at the comments in the JavaDocs that address my…
Eric Wilson
  • 51,818
  • 71
  • 192
  • 262
1
2 3
99 100