[ 
http://issues.apache.org/jira/browse/NUTCH-240?page=comments#action_12373256 ] 

Shawn Gervais commented on NUTCH-240:
-------------------------------------

This change seems to have caused an error to be thrown:

060405 034711 Generator: Partitioning selected urls by host, for politeness.
Exception in thread "main" java.lang.RuntimeException: class 
org.apache.nutch.crawl.Generator$SelectorInverseMapper not 
org.apache.hadoop.mapred.Mapper
        at org.apache.hadoop.conf.Configuration.setClass(Configuration.java:262)
        at org.apache.hadoop.mapred.JobConf.setMapperClass(JobConf.java:249)
        at org.apache.nutch.crawl.Generator.generate(Generator.java:263)
        at org.apache.nutch.crawl.Generator.main(Generator.java:317)

Just FYI.

> Scoring API: extension point, scoring filters and an OPIC plugin
> ----------------------------------------------------------------
>
>          Key: NUTCH-240
>          URL: http://issues.apache.org/jira/browse/NUTCH-240
>      Project: Nutch
>         Type: Improvement

>     Versions: 0.8-dev
>     Reporter: Andrzej Bialecki 
>     Assignee: Andrzej Bialecki 
>  Attachments: Generator.patch.txt, patch.txt, patch1.txt
>
> This patch refactors all places where Nutch manipulates page scores, into a 
> plugin-based API. Using this API it's possible to implement different scoring 
> algorithms. It is also much easier to understand how scoring works.
> Multiple scoring plugins can be run in sequence, in a manner similar to 
> URLFilters.
> Included is also an OPICScoringFilter plugin, which contains the current 
> implementation of the scoring algorithm. Together with the scoring API it 
> provides a fully backward-compatible scoring.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to