[ 
http://issues.apache.org/jira/browse/NUTCH-266?page=comments#action_12416958 ] 

KuroSaka TeruHiko commented on NUTCH-266:
-----------------------------------------

I noticed that there is no drive letter C: in the path quoted in the exception 
messages in both cases.  Since both cases are observed on the Windows platform, 
lack of the drive letter may lead to an access to the wrong drive, which might 
be a cause of these fatal errors.




> hadoop bug when doing updatedb
> ------------------------------
>
>          Key: NUTCH-266
>          URL: http://issues.apache.org/jira/browse/NUTCH-266
>      Project: Nutch
>         Type: Bug

>     Versions: 0.8-dev
>  Environment: windows xp, JDK 1.4.2_04
>     Reporter: Eugen Kochuev

>
> I constantly get the following error message
> 060508 230637 Running job: job_pbhn3t
> 060508 230637 
> c:/nutch/crawl-20060508230625/crawldb/current/part-00000/data:0+245
> 060508 230637 
> c:/nutch/crawl-20060508230625/segments/20060508230628/crawl_fetch/part-00000/data:0+296
> 060508 230637 
> c:/nutch/crawl-20060508230625/segments/20060508230628/crawl_parse/part-00000:0+5258
> 060508 230637 job_pbhn3t
> java.io.IOException: Target 
> /tmp/hadoop/mapred/local/reduce_qnd5sx/map_qjp7tf.out already exists
>         at org.apache.hadoop.fs.FileUtil.checkDest(FileUtil.java:162)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:62)
>         at 
> org.apache.hadoop.fs.LocalFileSystem.renameRaw(LocalFileSystem.java:191)
>         at org.apache.hadoop.fs.FileSystem.rename(FileSystem.java:306)
>         at 
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:101)
> Exception in thread "main" java.io.IOException: Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:341)
>         at org.apache.nutch.crawl.CrawlDb.update(CrawlDb.java:54)
>         at org.apache.nutch.crawl.Crawl.main(Crawl.java:114)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

Reply via email to