It looks like you have temporary results from previous run (probably killed or terminated not successfully). It shoudl be safe to remove db\webdb.new directory and start again.
regars
Piotr
EM wrote:
What does it mean if the bin/nutch analyze db 7 fails with:


050830 024914 Target pages from init(): 27419
050830 024914 Processing pagesByURL: Sorted 27419 instructions in 0.172
seconds.
050830 024914 Processing pagesByURL: Sorted 159412.79069767444
instructions/second
Finished at Tue Aug 30 02:49:14 EDT 2005
Exception in thread "main" java.io.IOException: already exists:
db\webdb.new\pagesByURL
        at org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:86)
        at
org.apache.nutch.db.WebDBWriter$CloseProcessor.closeDown(WebDBWriter.java:54
9)
        at org.apache.nutch.db.WebDBWriter.close(WebDBWriter.java:1544)
        at
org.apache.nutch.tools.DistributedAnalysisTool.completeRound(DistributedAnal
ysisTool.java:562)
        at
org.apache.nutch.tools.LinkAnalysisTool.iterate(LinkAnalysisTool.java:60)
        at
org.apache.nutch.tools.LinkAnalysisTool.main(LinkAnalysisTool.java:81)



Reply via email to