[ 
https://issues.apache.org/jira/browse/NUTCH-839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Julien Nioche closed NUTCH-839.
-------------------------------

       Resolution: Duplicate
    Fix Version/s: 1.4

https://issues.apache.org/jira/browse/NUTCH-937
                
> nutch doesnt run under 0.20.2+228-1~karmic-cdh3b1 version of hadoop
> -------------------------------------------------------------------
>
>                 Key: NUTCH-839
>                 URL: https://issues.apache.org/jira/browse/NUTCH-839
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 1.1
>         Environment: ubuntu linux version 2.6.31-14-server, x86_64 GNU/Linux
>            Reporter: Robert Gonzalez
>             Fix For: 1.4
>
>
> new versions of hadoop appear to put jars in a different format now, instead 
> of file:/a/b/c/d/job.jar, its now jar:file:/a/b/c/d/job.jar!, which breaks 
> nutch when its trying to load its plugins.  Specifically, the stack trace 
> looks like:
> Caused by: java.lang.RuntimeException: x point 
> org.apache.nutch.net.URLNormalizer not found.
>       at org.apache.nutch.net.URLNormalizers.<init>(URLNormalizers.java:124)
>       at 
> org.apache.nutch.crawl.Injector$InjectMapper.configure(Injector.java:57)
> A simple test class was written the used the URLFilters class, and the 
> following stack trace resulted:
> 10/07/01 14:25:25 INFO mapred.JobClient: Task Id : 
> attempt_201006171624_46525_m_000000_1, Status : FAILED
> java.lang.RuntimeException: org.apache.nutch.net.URLFilter not found.
>       at org.apache.nutch.net.URLFilters.<init>(URLFilters.java:52)
>       at com.maxpoint.crawl.BidSampler$BIdSMapper.setup(BidSampler.java:42)
>       at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>       at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:621)
>       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>       at org.apache.hadoop.mapred.Child.main(Child.java:170)
> Running this on an older version of hadoop works.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to