[ 
https://issues.apache.org/jira/browse/HADOOP-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651400#action_12651400
 ] 

Christian Kunz commented on HADOOP-4714:
----------------------------------------

io.sort.factor is 100, number of reducers 18,000.

>From your comment I would conclude that recordCtr is reset for every reducer, 
>making progress reporting dependent on the number of reducers. That would not 
>be good.

In my opiniion progress reporting should be configured to be at the minimum 
once per a certain time period (e.g. 1 minute). Using number of processed 
records could be used for cnvenience (although decompression/compression slows 
processing down a lot), but it would have to be a counter independent of the 
number of reducers.

> map tasks timing out during merge phase
> ---------------------------------------
>
>                 Key: HADOOP-4714
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4714
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.18.1
>            Reporter: Christian Kunz
>
> With compression of transient data turned on some parts of the merge phase 
> seem to not report progress enough.
> We see a lot of task failures during the merge phase, most of them timing out 
> (even with a 20 min timeout)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to