Hi, I am trying to modify the Fetcher code in Nutch.81 , but always get the exceptions below in the hadoop.log.
java.lang.RuntimeException: java.io.IOException: Version: 4 Status: 5 (fetch_success) Fetch time: Wed Mar 14 22:15:58 CST 2007 Modified time: Thu Jan 01 08:00:00 CST 1970 Retries since fetch: 0 Fetch interval: 0.5 days Score: 1.0 Signature: f4b0490606f416542246fb2bf625d0cb Metadata: null read 95 bytes, should read 159 at org.apache.hadoop.mapred.ReduceTask$ValuesIterator.next(ReduceTask.java:129) at org.apache.nutch.crawl.CrawlDbReducer.reduce(CrawlDbReducer.java:56) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:247) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:112) Why "159" bytes should be read here? Any hints for me to find the answer? Thanks -Qi
