[ https://issues.apache.org/jira/browse/MAPREDUCE-4838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zhijie Shen updated MAPREDUCE-4838: ----------------------------------- Attachment: MAPREDUCE-4838_1.patch I've migrated the code to add more task-info to JH for Hadoop-1 to that for Hadoop-2. There're the following major differences: 1) There's no JobInProgress (actually nearly empty) and TaskInProgress, where the locality ant the avataar attributes are set and logged. Instead, "avataar" is now set in TaskImpl#addAndScheduleAttempt by judging whether there are other active task attempts, while "locality" is set in TaskAttemptImpl#ContainerAssignedTransition#transition by judging whether the assigned container's host is within the local host/rack list of the task attempt. 2) workflow related info is added in JobImpl. The function "getWorkflowAdjacencies" and its dependent functions are also imported in this class. 3) Locality has the same enum values as the NodeType of yarn, but I still created Loality because it infers one attribute of a task attempt. The current trunk can be built correctly with the patch applied. However, I still need some more work with the test cases. Arun and Sid, please have a look at the patch, and give some comments. Thank you! Zhijie > Add extra info to JH files > -------------------------- > > Key: MAPREDUCE-4838 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-4838 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Reporter: Arun C Murthy > Assignee: Zhijie Shen > Attachments: MAPREDUCE-4838_1.patch, MAPREDUCE-4838.patch > > > It will be useful to add more task-info to JH for analytics. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira