[ 
https://issues.apache.org/jira/browse/TEZ-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14236544#comment-14236544
 ] 

Siddharth Seth commented on TEZ-1610:
-------------------------------------

I think both are useful metrics (when shuffle completed, and when the merger 
was closed). I think it's better to rename them though.
Minor:
- The TaskCounter is called MERGE_FINISH_TIME, it's referred to as 
MERGE_TIME_TAKEN in Shuffle.
- Make parameters final in Shuffle

Wondering if this should just be something along the lines of 
MERGED_INPUT_READY_DELTA (or a better name).
MERGE_TIME_TAKEN can be misleading - since the actual merge is likely to be in 
progress while the data is being read from the input (50 in memory segments for 
example - which don't trigger any limits will only be merged while being 
consumed).

SHUFFLE_TIME_TAKEN - similar. This isn't really absolute shuffle time taken, 
since a merge could happen while transferring data which would lead to the 
fetchers sitting idle. Also doesn't factor in the time waiting for events - in 
case of pre-starting tasks for example.

When per-IO counters are not enabled, and there's multiple Inputs, this will 
end up being the value for the latest merge / fetcher which completes, and may 
not be as useful.
Need some docs to clarify the behaviour in TaskCounters 

> additional task counters for fetchers
> -------------------------------------
>
>                 Key: TEZ-1610
>                 URL: https://issues.apache.org/jira/browse/TEZ-1610
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rajesh Balamohan
>            Assignee: Rajesh Balamohan
>         Attachments: TEZ-1610.1.patch
>
>
> - ShuffleFinishTime (per source)
> - Merge time (depending on broadcast/scatter-gather shuffle)
> This would be helpful in determining when shuffle started/ended for different 
> sources in a task.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to