[ 
https://issues.apache.org/jira/browse/HADOOP-5931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718268#action_12718268
 ] 

Sharad Agarwal commented on HADOOP-5931:
----------------------------------------

bq. I am assuming the moving window mechanism would be flexible enough to add 
new bucket sizes as required. 
Yes. I am planning to use and extend metric framework available in core, thru 
which custom window/bucket sizes can be defined.

bq. Regarding having the computation on the tasktracker, and reporting the 
status via status, one problem is that if we want to change the bucket size, it 
would involve a change in the status object.
To avoid that, instead of above fields, we can have say List<MetricInfo> 
metrics field in TaskTrackerStatus where MetricInfo could be:
class MetricInfo {
String name;
int tasks;
int tasksSucceeded;
}
Here name would be the name of the metrics. e.q. "lasthour", "lastday" etc. 
which could be configured in the metrics property file. 

bq. Also, one requirement for this is to store this information on the 
JobTracker. Can you describe how this will be stored, mechanics with respect to 
lost tasktrackers etc ?
Currently jobtracker doesn't store any information about lost tasktrackers. 
Storing info about lost trackers is not trivial and demands a separate jira 
issue. Consider the case of tracker getting lost and never coming back or 
coming back at different port. The jobtracker data structures need to be 
cleaned up for such trackers otherwise those data structures would be lying 
forever. 

bq. Will this information be available if the JobTracker restarts ?
Yes. Since this info is propagated from Tasktracker, it would be available 
after jobtracker restarts.

> Collect information about number of tasks succeeded / total per time unit for 
> a tasktracker. 
> ---------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-5931
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5931
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Hemanth Yamijala
>
> Collecting information of number of tasks succeeded / total per tasktracker 
> and being able to see these counts per hour, day and since start time will 
> help reason about things like the blacklisting strategy.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to