Bugs item #858060, was opened at 2003-12-11 01:47
Message generated for change (Comment added) made by naam_jap
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=858060&group_id=59548

Category: link analysis
Group: mainline
Status: Open
Resolution: None
Priority: 5
Submitted By: Paolo Spadafora (korfut)
Assigned to: Michael Cafarella (mike_cafarella)
Summary: generate db segments fails

Initial Comment:
Following all the instructions of the tutorial
http://www.nutch.org/docs/en/tutorial.html I have an
error when run the analysis of the web db, the error
occurs when i run "nutch analyze db 5".
I run on windows xp, jdk 1.4.2_02, 512 mb ram, 1Ghz
Intel PIII.
Error is:

031127 020936 Processing pagesByMD5: Merged
32836.00493218249 records/second
031127 020936 Processing linksByMD5: Copied file (0
bytes) in 0.141 secs.
031127 020936 Processing linksByURL: Copied file (0
bytes) in 0.09 secs.
Finished at Thu Nov 27 02:09:36 EST 2003
Exception in thread "main" java.io.IOException: Could
not delete 
1dbwebdb.new    
mplinksByMD5.out
        at
net.nutch.util.FileUtil.fullyDelete(FileUtil.java:23)
        at
net.nutch.db.WebDBWriter.close(WebDBWriter.java:1626)
        at 
 net.nutch.tools.DistributedAnalysisTool.completeRound(DistributedAnal
ysisTool.java:521)
        at 
 net.nutch.tools.LinkAnalysisTool.iterate(LinkAnalysisTool.java:42)
        at
net.nutch.tools.LinkAnalysisTool.main(LinkAnalysisTool.java:62)
C:Codejava
utch-2003-11-17in>


----------------------------------------------------------------------

Comment By: Singh (naam_jap)
Date: 2004-03-28 00:34

Message:
Logged In: YES 
user_id=957081

Is this fixed yet?

----------------------------------------------------------------------

Comment By: Paolo Spadafora (korfut)
Date: 2003-12-12 01:17

Message:
Logged In: YES 
user_id=713420

i tryied again and it fails when "generate db segments"
Exception in thread "main" java.io.IOException: Could not
delete db\webdb.new\tm
p\linksByMD5.out
        at net.nutch.util.FileUtil.fullyDelete(FileUtil.java:23)
        at net.nutch.db.WebDBWriter.close(WebDBWriter.java:1626)
        at
net.nutch.tools.FetchListTool.emitFetchList(FetchListTool.java:555)
        at
net.nutch.tools.FetchListTool.emitFetchList(FetchListTool.java:303)
        at
net.nutch.tools.FetchListTool.main(FetchListTool.java:658)

I see the folder webdb.new contains a tmp folder that
contains a linksByMD5.out file of zero bytes.
Seems there is a MapFile Reader still open, linksByMD5.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=858060&group_id=59548


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to