Hello all,

I'm having trouble getting a large mapping job to complete. Several of
thousands of mappers are failing with this error:

java.io.FileNotFoundException: File does not exist:
/data/hadoop/cache/mapred/mapred/staging/yuval/.staging/job_201011120027_144772/job.split
        at 
org.apache.hadoop.hdfs.DFSClient$DFSInputStream.openInfo(DFSClient.java:1586)
        at 
org.apache.hadoop.hdfs.DFSClient$DFSInputStream.<init>(DFSClient.java:1577)
        at org.apache.hadoop.hdfs.DFSClient.open(DFSClient.java:428)
        at 
org.apache.hadoop.hdfs.DistributedFileSystem.open(DistributedFileSystem.java:185)
        at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:431)
        at org.apache.hadoop.mapred.MapTask.getSplitDetails(MapTask.java:325)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:357)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:317)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:217)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063)
        at org.apache.hadoop.mapred.Child.main(Child.java:211)


This is on Cloudera's CDH3 release. Any ideas?


Thanks!

Yuval

Reply via email to