We have a slight problem using default stemmer. The problem is that some words are stemmed the way they cannot be used later while searching. For example imagine we have a phrase "iron ore" on some webpage. Nutch fetches the page and stores stemmed version of every word in its index so "iron ore" becomes "iron or".
The problem is that we cannot search for "ore" - Nutch shows the results of pages that contain simple "OR" word because "ORE" and "OR" are stemmed exactly the same way. So if we search for "Iron Ore" Nutch actually shows the webpages containing "Iron" and "Or". Does anybody know how to fix that ? ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
