Doug
Luke Baker wrote:
Has anybody seen this error? I'm trying to update the db from scratch using some previously fetched segments. I get this error on a 2 million page segment during the past two attempts to update the db with this segment's data:
-----------------------------
050301 012722 Processing document 2002000
050301 012725 Finishing update
Exception in thread "main" java.io.IOException: db/webdb.new/tmp/pagesByURL.out.sorted.1 not a SequenceFile
at net.nutch.io.SequenceFile$Reader.init(SequenceFile.java:200)
at net.nutch.io.SequenceFile$Reader.<init>(SequenceFile.java:189)
at net.nutch.io.SequenceFile$Reader.<init>(SequenceFile.java:151)
at net.nutch.io.SequenceFile$Sorter$MergePass.run(SequenceFile.java:587) at net.nutch.io.SequenceFile$Sorter.mergePass(SequenceFile.java:546)
at net.nutch.io.SequenceFile$Sorter.sort(SequenceFile.java:375)
at net.nutch.db.WebDBWriter$CloseProcessor.closeDown(WebDBWriter.java:522)
at net.nutch.db.WebDBWriter.close(WebDBWriter.java:1531)
at net.nutch.tools.UpdateDatabaseTool.close(UpdateDatabaseTool.java:305) at net.nutch.tools.UpdateDatabaseTool.main(UpdateDatabaseTool.java:355)
-------------------
Has anybody seen this before? Any ideas on what is causing it?
Thanks,
Luke
------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers
------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers