[ 
https://issues.apache.org/jira/browse/HADOOP-3131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12615720#action_12615720
 ] 

Matei Zaharia commented on HADOOP-3131:
---------------------------------------

I'm not sure it's worth fixing that, because we don't need perfect progress 
reporting, just a rough guide to tell whether a task is doing something, and 
what rate it's working at. With compression enabled, it would also be  
difficult to figure out which spot in the buffer corresponds to which byte in 
the compressed file, if we were to use the position in the buffer to figure out 
progress.

> enabling BLOCK compression for map outputs breaks the reduce progress counters
> ------------------------------------------------------------------------------
>
>                 Key: HADOOP-3131
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3131
>             Project: Hadoop Core
>          Issue Type: Bug
>    Affects Versions: 0.17.0, 0.17.1, 0.17.2, 0.18.0, 0.19.0
>            Reporter: Colin Evans
>            Assignee: Matei Zaharia
>             Fix For: 0.19.0
>
>         Attachments: HADOOP-3131-v2.patch, HADOOP-3131-v3.patch, 
> HADOOP-3131-v4.patch, HADOOP-3131-v5.patch, merge-progress-trunk.patch, 
> merge-progress.patch, Picture 1.png
>
>
> Enabling map output compression and setting the compression type to BLOCK 
> causes the progress counters during the reduce to go crazy and report 
> progress counts over 100%.
> This is problematic for speculative execution because it thinks the tasks are 
> doing fine.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to