The problem is still there, maybe I'm doing something wrong?

1. 'rm -r db' 
2. 'mkdir db'
3. ' bin/nutch admin db -create'
4. I'll then updatedb db from a fetched segment, this should fill it up with
links?
5. 'bin/nutch analylze db 7'
And it fails here with three 'tmp<something>' directories and webdb.new 



-----Original Message-----
From: Piotr Kosiorowski [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, August 30, 2005 3:07 PM
To: [email protected]
Subject: Re: Analyser error

It looks like you have temporary results from previous run (probably 
killed or terminated not successfully). It shoudl be safe to remove 
db\webdb.new directory and start again.
regars
Piotr
EM wrote:
> What does it mean if the bin/nutch analyze db 7 fails with:
> 
> 
> 050830 024914 Target pages from init(): 27419
> 050830 024914 Processing pagesByURL: Sorted 27419 instructions in 0.172
> seconds.
> 050830 024914 Processing pagesByURL: Sorted 159412.79069767444
> instructions/second
> Finished at Tue Aug 30 02:49:14 EDT 2005
> Exception in thread "main" java.io.IOException: already exists:
> db\webdb.new\pagesByURL
>         at org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:86)
>         at
>
org.apache.nutch.db.WebDBWriter$CloseProcessor.closeDown(WebDBWriter.java:54
> 9)
>         at org.apache.nutch.db.WebDBWriter.close(WebDBWriter.java:1544)
>         at
>
org.apache.nutch.tools.DistributedAnalysisTool.completeRound(DistributedAnal
> ysisTool.java:562)
>         at
> org.apache.nutch.tools.LinkAnalysisTool.iterate(LinkAnalysisTool.java:60)
>         at
> org.apache.nutch.tools.LinkAnalysisTool.main(LinkAnalysisTool.java:81)
> 
> 



Reply via email to