We have a slight problem using default stemmer. The problem is that
some words are stemmed the way they cannot be used later while
searching.
For example imagine we have a phrase "iron ore" on some webpage.
Nutch fetches the page and stores stemmed version of every word in its
index so "iron ore" becomes "iron or".

The problem is that we cannot search for "ore" - Nutch shows the
results of pages that contain simple "OR" word because "ORE" and "OR"
are stemmed exactly the same way. So if we search for "Iron Ore" Nutch
actually shows the webpages containing "Iron" and "Or".

Does anybody know how to  fix that ?


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to