Sean Dean wrote:
Hello,

In an attempt to run my regular scheduled fetch, and to also see if I could 
reproduce your error it seems I may have found another one. The following 
procedure was done with the most recent trunk version, including the Hadoop-0.9 
update.

bin/nutch generate crawl/crawldb crawl/segments -topN 1000000

This command was successful, fetch list was generated without error.

bin/nutch fetch crawl/segments/20061211105651

Actual fetch was successful, but the reduce stage failed. Error output was:

Fetcher: java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:393)
        at org.apache.nutch.fetcher.Fetcher.fetch(Fetcher.java:445)
        at org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:480)
        at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:187)
        at org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:452)



This stacktrace doesn't tell anything except that the job failed ... Please find the corresponding entries in tasktracker's log, they should provide more details regarding the reason for this failure. You may also wish to increase the log level to get more details in logs.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Reply via email to