[jira] Commented: (HADOOP-5865) Shuffle's getMapOutput() fails with EofException, followed by IllegalStateException

Jothi Padmanabhan (JIRA) Mon, 18 May 2009 20:43:09 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-5865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12710602#action_12710602
 ]


Jothi Padmanabhan commented on HADOOP-5865:
-------------------------------------------

The Jetty EOFException is observed when the Reduce Task prematurely closes a 
connection to a jetty server. The RT might decide to do this when the expected 
map output size is less than its memory cache limit, but cannot fetch it now 
because it has already fetched several other map outputs to memory. So, it 
triggers a flush of the existing map outputs from memory to disk and once this 
flush is complete, it will attempt to fetch the map output again by reopening 
the connection to the Jetty. In short, it is OK to see such messages.

This should neither cause a task nor a job failure. Are you seeing failures? 
The reason why these error messages appear for some jobs and not for others is 
primarily got to do with the size of map outputs and the heap size allocated 
for the reducer.

> Shuffle's getMapOutput() fails with EofException, followed by 
> IllegalStateException
> -----------------------------------------------------------------------------------
>
>                 Key: HADOOP-5865
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5865
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>         Environment: Sun Java 1.6.0_13, OpenSolaris, running on a SunFire 
> 4150 (x64) 10 node cluster
>            Reporter: George Porter
>
> During the shuffle phase, I'm seeing a large sequence of the following 
> actions:
> 1) WARN org.apache.hadoop.mapred.TaskTracker: 
> getMapOutput(attempt_200905181452_0002_m_000010_0,0) failed : 
> org.mortbay.jetty.EofException
> 2) WARN org.mortbay.log: Committed before 410 
> getMapOutput(attempt_200905181452_0002_m_000010_0,0) failed : 
> org.mortbay.jetty.EofException
> 3) ERROR org.mortbay.log: /mapOutput java.lang.IllegalStateException: 
> Committed
> The map phase completes with 100%, and then the reduce phase crawls along 
> with the above errors in each of the TaskTracker logs.  None of the 
> tasktrackers get lost.  When I run non-data jobs like the 'pi' test from the 
> example jar, everything works fine.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-5865) Shuffle's getMapOutput() fails with EofException, followed by IllegalStateException

Reply via email to