[ 
https://issues.apache.org/jira/browse/MAPREDUCE-115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914379#action_12914379
 ] 

Torsten Curdt commented on MAPREDUCE-115:
-----------------------------------------

We were able reproduce it with every job run. Now we switched from lzo to gz 
compression for the input and turned off intermediate compression and that 
worked.

So I am not sure this was (also - like another issue) related the lzo 
compression or because of a bug in the intermediate compression.

> Map tasks are receiving FileNotFound Exceptions for spill files on a regular 
> basis and are getting killed
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-115
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-115
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 0.20.2
>            Reporter: Jothi Padmanabhan
>
> The following is the log -- Map tasks are unable to locate the spill files 
> when they are doing the final merge (mergeParts). 
> java.io.FileNotFoundException: File 
> /xxx/mapred-tt/mapred-local/taskTracker/jobcache/job_200808190959_0001/attempt_200808190959_0001_m_000000_0/output/spill23.out
>  does not exist.
>       at 
> org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:420)
>       at 
> org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:244)
>       at 
> org.apache.hadoop.fs.FileSystem.getContentSummary(FileSystem.java:682)
>       at 
> org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.getFileLength(ChecksumFileSystem.java:218)
>       at 
> org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.seek(ChecksumFileSystem.java:259)
>       at 
> org.apache.hadoop.fs.FSDataInputStream.seek(FSDataInputStream.java:37)
>       at 
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer.mergeParts(MapTask.java:1102)
>       at 
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:769)
>       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:255)
>       at 
> org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2208)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to