Hi,
I am trying to modify the Fetcher code in Nutch.81 , but always get the
exceptions below in the hadoop.log.
java.lang.RuntimeException: java.io.IOException: Version: 4
Status: 5 (fetch_success)
Fetch time: Wed Mar 14 22:15:58 CST 2007
Modified time: Thu Jan 01 08:00:00 CST 1970
Retries since fetch: 0
Fetch interval: 0.5 days
Score: 1.0
Signature: f4b0490606f416542246fb2bf625d0cb
Metadata: null
read 95 bytes, should read 159
at org.apache.hadoop.mapred.ReduceTask$ValuesIterator.next(ReduceTask.java:129)
at org.apache.nutch.crawl.CrawlDbReducer.reduce(CrawlDbReducer.java:56)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:247)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:112)
Why "159" bytes should be read here? Any hints for me to find the answer?
Thanks
-Qi
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general