Bugs item #858060, was opened at 2003-12-11 01:47 Message generated for change (Comment added) made by naam_jap You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=858060&group_id=59548
Category: link analysis Group: mainline Status: Open Resolution: None Priority: 5 Submitted By: Paolo Spadafora (korfut) Assigned to: Michael Cafarella (mike_cafarella) Summary: generate db segments fails Initial Comment: Following all the instructions of the tutorial http://www.nutch.org/docs/en/tutorial.html I have an error when run the analysis of the web db, the error occurs when i run "nutch analyze db 5". I run on windows xp, jdk 1.4.2_02, 512 mb ram, 1Ghz Intel PIII. Error is: 031127 020936 Processing pagesByMD5: Merged 32836.00493218249 records/second 031127 020936 Processing linksByMD5: Copied file (0 bytes) in 0.141 secs. 031127 020936 Processing linksByURL: Copied file (0 bytes) in 0.09 secs. Finished at Thu Nov 27 02:09:36 EST 2003 Exception in thread "main" java.io.IOException: Could not delete 1dbwebdb.new mplinksByMD5.out at net.nutch.util.FileUtil.fullyDelete(FileUtil.java:23) at net.nutch.db.WebDBWriter.close(WebDBWriter.java:1626) at net.nutch.tools.DistributedAnalysisTool.completeRound(DistributedAnal ysisTool.java:521) at net.nutch.tools.LinkAnalysisTool.iterate(LinkAnalysisTool.java:42) at net.nutch.tools.LinkAnalysisTool.main(LinkAnalysisTool.java:62) C:Codejava utch-2003-11-17in> ---------------------------------------------------------------------- Comment By: Singh (naam_jap) Date: 2004-03-28 00:34 Message: Logged In: YES user_id=957081 Is this fixed yet? ---------------------------------------------------------------------- Comment By: Paolo Spadafora (korfut) Date: 2003-12-12 01:17 Message: Logged In: YES user_id=713420 i tryied again and it fails when "generate db segments" Exception in thread "main" java.io.IOException: Could not delete db\webdb.new\tm p\linksByMD5.out at net.nutch.util.FileUtil.fullyDelete(FileUtil.java:23) at net.nutch.db.WebDBWriter.close(WebDBWriter.java:1626) at net.nutch.tools.FetchListTool.emitFetchList(FetchListTool.java:555) at net.nutch.tools.FetchListTool.emitFetchList(FetchListTool.java:303) at net.nutch.tools.FetchListTool.main(FetchListTool.java:658) I see the folder webdb.new contains a tmp folder that contains a linksByMD5.out file of zero bytes. Seems there is a MapFile Reader still open, linksByMD5. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=858060&group_id=59548 ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
