Re: [Nutch-general] Link db (traversal + modification)

Honda-Search Administrator Sat, 08 Jul 2006 20:07:29 -0700

Looks like you cancelled a previous crawl in the middle of it, or somethingelse did.

delete the D:\cygwin\home\nutch-0.7.2\bin\fqjoke2\db\webdb.old directory andrecrawl. You should be fine.

----- Original Message -----From: "kevin" <[EMAIL PROTECTED]>

To: <[email protected]>
Sent: Saturday, July 08, 2006 3:34 AM
Subject: Re: [Nutch-general] Link db (traversal + modification)

Hi,

I ran nutch using this command:
$ ./nutch crawl urlfq2.txt -dir fqjoke2 -depth 20 -threads 10 >& fq2.log
during the crawling ,the following exception occured:
060708 182413 status: segment 20060708181314, 471 pages, 69 errors,5655871 bytes, 657469 ms060708 182413 status: 0.7163836 pages/s, 67.20696 kb/s, 12008.219bytes/page
060708 182414 Updating D:\cygwin\home\nutch-0.7.2\bin\fqjoke2\db
Exception in thread "main" java.io.IOException: Impossible condition:directories D:\cygwin\home\nutch-0.7.2\bin\fqjoke2\db\webdb.old andD:\cygwin\home\nutch-0.7.2\bin\fqjoke2\db\webdb cannot existsimultaneously
   at org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1484)
   at org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1457)
atorg.apache.nutch.tools.UpdateDatabaseTool.main(UpdateDatabaseTool.java:360)
   at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:141)



why this happened ? any solution available? many thanks!

Re: [Nutch-general] Link db (traversal + modification)

Reply via email to