[ http://issues.apache.org/jira/browse/NUTCH-240?page=comments#action_12373256 ]
Shawn Gervais commented on NUTCH-240: ------------------------------------- This change seems to have caused an error to be thrown: 060405 034711 Generator: Partitioning selected urls by host, for politeness. Exception in thread "main" java.lang.RuntimeException: class org.apache.nutch.crawl.Generator$SelectorInverseMapper not org.apache.hadoop.mapred.Mapper at org.apache.hadoop.conf.Configuration.setClass(Configuration.java:262) at org.apache.hadoop.mapred.JobConf.setMapperClass(JobConf.java:249) at org.apache.nutch.crawl.Generator.generate(Generator.java:263) at org.apache.nutch.crawl.Generator.main(Generator.java:317) Just FYI. > Scoring API: extension point, scoring filters and an OPIC plugin > ---------------------------------------------------------------- > > Key: NUTCH-240 > URL: http://issues.apache.org/jira/browse/NUTCH-240 > Project: Nutch > Type: Improvement > Versions: 0.8-dev > Reporter: Andrzej Bialecki > Assignee: Andrzej Bialecki > Attachments: Generator.patch.txt, patch.txt, patch1.txt > > This patch refactors all places where Nutch manipulates page scores, into a > plugin-based API. Using this API it's possible to implement different scoring > algorithms. It is also much easier to understand how scoring works. > Multiple scoring plugins can be run in sequence, in a manner similar to > URLFilters. > Included is also an OPICScoringFilter plugin, which contains the current > implementation of the scoring algorithm. Together with the scoring API it > provides a fully backward-compatible scoring. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira