In search engines, when you submit a search string, the search engine first
tokenizes and then searches for each substring string separately and then
brings them together as your found set. So if I search for 1961 Chateau
Lafite, I'll get items with 1961, others with Chateau or Chateu Lafite, and
on top will be the found records with 1961 Chateau Lafite (I know, if you
put it in quotes, it forces it to find only the whole string. That part is
easy)

They will also rank a find of the full set of terms above ones with one or
two terms in the documents.

Questions:

What's the approach with witango that will enable the search of tokenized
strings. 

Any ideas on how to do a crude ranking, such that the full term comes up on
top of the found set?


________________________________________________________________________
TO UNSUBSCRIBE: Go to http://www.witango.com/developer/maillist.taf

Reply via email to