[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13125185#comment-13125185
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2666:
----------------------------------------------------

TaskAttemptStartedEvent is fine. If we can just log it in JobHistory here, we 
can do the recovery part after MAPREDUCE-2708 goes in, as part of another JIRA.

Patch is in the right direction. In fact, it is almost there, pending any tests 
you want to add.
                
> MR-279: Need to retrieve shuffle port number on ApplicationMaster restart
> -------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2666
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2666
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>          Components: mrv2
>    Affects Versions: 0.23.0
>            Reporter: Robert Joseph Evans
>            Assignee: Jonathan Eagles
>            Priority: Blocker
>             Fix For: 0.23.0
>
>         Attachments: MAPREDUCE-2666.patch
>
>
> MAPREDUCE-2652 allows ShuffleHandler to return the port it is operating on.  
> In the case of an ApplicationMaster crash where it needs to be restarted that 
> information is lost.  We either need to re-query it from each of the 
> NodeManagers or to persist it to the JobHistory logs and retrieve it again.  
> The job history logs is probably the simpler solution.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to