Search engine works with words, but some noun is compound by multiple words. New York is a noun, not two words. With a list of noun, search engine can handle it well, and wikipedia can help.
Tag - lucene
Looking for New York, the shingle way
Par Mathieu Lecarme le dimanche, 6 avril 2008, 14:17 - Moteur de recherche
Indexing mp3 database with Python and Lucene
Par Mathieu Lecarme le samedi, 15 mars 2008, 14:11 - Informatique
MP3 player uses a database for managing thousands of songs. Here is a Python test for indexing the XML dump of common MP3 player (rhytmbox and iTunes), to a Lucene index, via Goniometre, a Passerelle project.
Using Compass without dirtying its hands with java
Par Mathieu Lecarme le mardi, 11 mars 2008, 20:35 - Moteur de recherche
A lexicon approach for Lucene full text search engine.
Par Mathieu Lecarme le vendredi, 7 mars 2008, 23:24 - Moteur de recherche
Lucene uses an index to find document from thier words. Storing more informations with each words, ie building a lexicon, can expands Lucene search and helps query refining.


