Differences between revisions 9 and 10
Revision 9 as of 2008-03-10 17:48:40
Size: 1257
Editor: mpiat1403
Comment: Full text search definition
Revision 10 as of 2008-03-10 17:56:38
Size: 1319
Editor: mpiat1403
Comment: Stop word definition
Deletions are marked like this. Additions are marked like this.
Line 21: Line 21:
 * '''Stop word''':  * '''Stop word''': A word that is discarded as a token in indexing and querying.

This page gives a glossary of the most important terms in the search engine nomenclature.

A

  • Analyzer: Analyzers are components that preprocess input text at index time and/or at search time. It's important to use the same or similar analyzers that process text in a compatible manner at index and query time. For example, if an indexing analyzer lowercases words, then the query analyzer should do the same to enable finding the indexed words.

F

  • Full text search: In a full text search, the search engine examines all of the words in every stored document as it tries to match search words supplied by the user.

  • Free text:

P

  • Protected word: A word that is not modified by any stemming transformation.

S

  • Stemming: A transforming algorithm that reduces any of the forms of a word such as "runs, running, ran" to its elemental root ("run") or that does the inverse, that is, it takes a root word and expands it to all of its various form.

  • Stop word: A word that is discarded as a token in indexing and querying.

T

  • Token: An analyzer splits up an input text into a series of tokens. A token is a substring of the input text that is indexed or queried for and not split any further.

CompleteSearch: completesearch/Glossary (last edited 2008-09-29 15:49:39 by mpiat1403)