[
https://issues.apache.org/jira/browse/MAPREDUCE-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13125185#comment-13125185
]
Vinod Kumar Vavilapalli commented on MAPREDUCE-2666:
----------------------------------------------------
TaskAttemptStartedEvent is fine. If we can just log it in JobHistory here, we
can do the recovery part after MAPREDUCE-2708 goes in, as part of another JIRA.
Patch is in the right direction. In fact, it is almost there, pending any tests
you want to add.
> MR-279: Need to retrieve shuffle port number on ApplicationMaster restart
> -------------------------------------------------------------------------
>
> Key: MAPREDUCE-2666
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2666
> Project: Hadoop Map/Reduce
> Issue Type: Sub-task
> Components: mrv2
> Affects Versions: 0.23.0
> Reporter: Robert Joseph Evans
> Assignee: Jonathan Eagles
> Priority: Blocker
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2666.patch
>
>
> MAPREDUCE-2652 allows ShuffleHandler to return the port it is operating on.
> In the case of an ApplicationMaster crash where it needs to be restarted that
> information is lost. We either need to re-query it from each of the
> NodeManagers or to persist it to the JobHistory logs and retrieve it again.
> The job history logs is probably the simpler solution.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira