hi everyone

i dont know if we're doing something wrong, but the quality of the text
in the Nutch search results is appauling. 


To give you an example:

the text outputted for http://www.gamingalmanac.com/ is the following:

 ... Gaming Industry Research Publications, Worldwide Gaming Almanacs,
 Bear Stearns Gaming Almanac, Gaming Revenue and Statistics PRODUCT
 OVERVIEW COMPLETE ANALYST PACKAGE NORTH AMERICAN ALMANAC INDIAN GAMING
 INDUSTRY REPORT NEVADA GAMING ALMANAC GLOBAL GAMING ALMANAC GLOBAL
 GAMBLING REPORT MARKET RESEARCH HANDBOOK MICROSOFT MAP POINT Save up to
 45% with a Gaming Analyst Package! The Gaming Almanac Family of
 Products Find every fact, figure, and trend you need on the gaming
 industry. With current property profiles and statistics, historical and
 forward-looking financial data, local, regional, and worldwide gaming
 market summaries, and key player profiles, the Gaming Almanac products
 from Casino City Press offer information essential to every gaming
 executive, supplier, and analyst. Titles Include: Casino City ... 

whereas Google outputs:

Gaming Industry Research Publications, Worldwide Gaming Almanacs ...
The Gaming Almanac products from Casino City Press serve as excellent
reference tools for anyone interested in the worldwide and domestic
gaming markets.
gamingalmanac.com/

Is there any easy way to fix this? The Nutch search results appear to
include text in the website menu's, etc. which affects the usability of
the search results.

Where in Nutch would I go about fixing this?

Thanks

Jamie



-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to