Stefan Groschupf wrote: >> >> The idea to have >> someething like this as a nutch-module (dropping pages or ranking them >> very low) might come up :-) > > This will be a very long way. > I collect some thoughts and a list of web spam related papers in my blog. > http://www.find23.net/Web-Site/blog/521BA1CD-14C4-4E84-A072-F98E13CAEFE1.html > > Feedback is welcome.
Have a look also at the published papers from the (just finished) WWW2006 conference: http://www2006.org/tracks/#session_paper03 . Other papers related to search (Search Engineering, Search tracks) are equally interesting ... Enjoy the reading! :) -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
