[ http://issues.apache.org/jira/browse/HADOOP-263?page=all ]
Sanjay Dahiya updated HADOOP-263:
---------------------------------
Attachment: patch.txt
A new patch, now shows start, finish times for tasks and shuffle/sort finished
time for reduce tasks.
few points, which should probably be separate issues.
- TaskStatus, TeskReport replicate some status data which is present as member
variables in TaskTracker.TaskInProgress, TaskInProgress. We can reduce the
duplication of data by maintaining objects of TaskStatus and get/set from that
directly, using simple composition.
- The status string passed around needs to be a public static final String
defined in TaskStatus and used everywhere else, so it can be compared against
reliably wrt future changes. Currently its reduce > copy (...), reduce > sort,
reduce > reduce.
> task status should include timestamps for when a job transitions
> ----------------------------------------------------------------
>
> Key: HADOOP-263
> URL: http://issues.apache.org/jira/browse/HADOOP-263
> Project: Hadoop
> Issue Type: Improvement
> Components: mapred
> Affects Versions: 0.2.1
> Reporter: Owen O'Malley
> Assigned To: Sanjay Dahiya
> Fix For: 0.6.0
>
> Attachments: patch.txt, patch.txt
>
>
> It would help users to understand what happened if the task status included
> information about when the task transitioned:
> Map:
> started
> finished
> Reduce:
> started
> shuffle finished
> sort finished
> finished
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira