[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhijie Shen updated MAPREDUCE-4838:
-----------------------------------

    Attachment: MAPREDUCE-4838_1.patch

I've migrated the code to add more task-info to JH for Hadoop-1 to that for 
Hadoop-2. There're the following major differences:

1) There's no JobInProgress (actually nearly empty) and TaskInProgress, where 
the locality ant the avataar attributes are set and logged. Instead, "avataar" 
is now set in TaskImpl#addAndScheduleAttempt by judging whether there are other 
active task attempts, while "locality" is set in 
TaskAttemptImpl#ContainerAssignedTransition#transition by judging whether the 
assigned container's host is within the local host/rack list of the task 
attempt.

2) workflow related info is added in JobImpl. The function 
"getWorkflowAdjacencies" and its dependent functions are also imported in this 
class.

3) Locality has the same enum values as the NodeType of yarn, but I still 
created Loality because it infers one attribute of a task attempt.

The current trunk can be built correctly with the patch applied. However, I 
still need some more work with the test cases.

Arun and Sid, please have a look at the patch, and give some comments. Thank 
you!

Zhijie
                
> Add extra info to JH files
> --------------------------
>
>                 Key: MAPREDUCE-4838
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4838
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Arun C Murthy
>            Assignee: Zhijie Shen
>         Attachments: MAPREDUCE-4838_1.patch, MAPREDUCE-4838.patch
>
>
> It will be useful to add more task-info to JH for analytics.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to