to Jerome

> Just a silly question: Do you build your index with the analyzers turned on?
> (does the documents language was correctly guessed and the corresponding
> analyzer called?)

Yes, I build the index with the activated analyzers.
For example: The page contain next words (in a different
forms)(text is in russian):

 - fish (different forms),
 - sea,
 - mission (only in main form),
 - electricity,
 - aquarium (different forms),
 - lighting (different forms).

 1)with stemming

 - fish (main form and not) - find (stemming works)
 - sea - can't find
 - mission (only in main form)- can't find
 - electricity - can't find
 - aquarium - find when the queries contain main form of the word
 - lighting (in different forms)- find when the queries contain main form of 
the word

 2)without stemming (queries contain certain form of the word)

 - fish (main form and not) - find
 - sea - find
 - mission - find
 - electricity - find
 - aquarium - find
 - lighting - find

 
-----------

Regards

Alexey


All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to