Questions tagged [search-engine]

A search engine is program that searches documents for specified keywords and returns a list of the documents where the keywords were found.

A search engine is a program that searches documents for specified keywords and returns a list of the documents where the keywords were found.

Although search engine is really a general class of programs, the term is often used to specifically describe systems like Google, Yahoo!, Yandex and Excite that enable users to search for documents on the World Wide Web and USENET newsgroups.

2893 questions
32
votes
5 answers

Improving search result using Levenshtein distance in Java

I have following working Java code for searching for a word against a list of words and it works perfectly and as expected: public class Levenshtein { private int[][] wordMartix; public Set similarExists(String searchWord) { int…
30
votes
7 answers

Search engine solution for Django that actually works?

The story so far: Decided to go with Xapian as search backend because it has all search-engine features I was looking for, knows about Unicode, stemming, has few dependencies and requires no bloated app-server installation on top of it. Tried Django…
nikola
  • 2,151
  • 3
  • 26
  • 42
29
votes
9 answers

Which are the best alternatives to Lucene?

It may run on Unix and it will be used for email searching (Dovecot, Postfix and maildir). Lucene is not a problem, I'm just analyzing some alternatives.
Rui Carneiro
  • 5,299
  • 4
  • 31
  • 38
29
votes
2 answers

Elasticsearch - How to normalize score when combining regular query and function_score?

Idealy what I am trying to achieve is to assign weights to queries such that query1 constitutes 30% of the final score and query2 consitutes other 70%, so to achieve the maximum score a document has to have highest possible score on query1 and…
JohnnyM
  • 1,123
  • 1
  • 11
  • 25
27
votes
4 answers

Use of indexes for multi-word queries in full-text search (e.g. web search)

I understand that a fundamental aspect of full-text search is the use of inverted indexes. So, with an inverted index a one-word query becomes trivial to answer. Assuming the index is structured like this: some-word -> [doc385, doc211, doc39977,…
26
votes
4 answers

Marking up a search result list with HTML5 semantics

Making a search result list (like in Google) is not very hard, if you just need something that works. Now, however, I want to do it with perfection, using the benefits of HTML5 semantics. The goal is to define the defacto way of marking up a search…
Johan
  • 4,848
  • 3
  • 32
  • 48
25
votes
3 answers

Connect to SphinxQL through Linux command-line

I am trying to connect to SphinxQL server through Linux command-line this way: > mysql -P 9306 ERROR 1045 (28000): Access denied for user 'root'@'localhost' (using password: NO) My Sphinx config file has 2 listen entries: listen =…
snippetsofcode
  • 867
  • 2
  • 9
  • 10
24
votes
3 answers

List of JSON search engine APIs without quotas, like Bing?

I'd like to display some custom search results. I've looked at the JSON APIs of both Google and Microsoft (Bing). Unfortunately, Google has a limit on the amount of queries a day ($50 for a maximum of ten thousand queries). However, Bing allows an…
Tom
  • 8,437
  • 26
  • 122
  • 217
23
votes
9 answers

Google-like Search Engine in PHP/mySQL

We have OCRed thousands of pages of newspaper articles. The newspaper, issue, date, page number and OCRed text of each page has been put into a mySQL database. We now want to build a Google-like search engine in PHP to find the pages given a query.…
lkessler
  • 19,414
  • 31
  • 125
  • 196
22
votes
6 answers

Is there any free unlimited album artwork search API service?

Google's custom search API has a limitation up to 100 queries per day. That is far less than what I expected. I want to add that artwork-search function to my app. Thanks a lot.
Li Fumin
  • 1,195
  • 1
  • 11
  • 31
22
votes
5 answers

Can search engine spiders see content I add using jQuery?

I currently have something like this

Will search engines be able to spider the "hey" text? and if yes, what…
Matthew Hui
  • 3,241
  • 2
  • 23
  • 37
22
votes
11 answers

What's a good source code search engine?

The codebase I work on is huge, and grepping it takes about 20 minutes. I'm looking for a good web-based source code search engine.. something like an intranet version of koders.com. The only thing I've found is Krugle Enterprise Edition, which…
toohool
  • 1,007
  • 1
  • 9
  • 13
21
votes
10 answers

What are some Search Servers out there?

I'm looking to find alternatives to Solr from the Apache Software Foundation. For those that don't know, Solr is an enterprise search server. A client application uses a web-services like interface to submit documents for indexing and also to…
bpapa
  • 21,107
  • 23
  • 95
  • 147
21
votes
13 answers

How would you design a good search UI?

I want to provide my users with an 'advanced' search engine. I basically have a lot of search criteria to chose from : some are very simple/common and will be largely use (ie time period, item id) some are a bit less mainstream and some won't be…
Brann
  • 29,395
  • 31
  • 107
  • 159
21
votes
5 answers

How do I do a partial field match using Haystack?

I needed a simple search tool for my django-powered web site, so I went with Haystack and Solr. I have set everything up correctly and can find the correct search results when I type in the exact phrase, but I can't get any results when typing in a…
Ben S
  • 1,227
  • 1
  • 12
  • 25