[ 
https://issues.apache.org/jira/browse/HADOOP-3131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12604294#action_12604294
 ] 

Matei Zaharia commented on HADOOP-3131:
---------------------------------------

Looking at the patch submitted for HADOOP-2095, it seems that it has the same 
problem (by doing totalBytesProcessed += (key.getLength()-key.getPosition()) + 
 (value.getLength()-value.getPosition())). I can submit a separate patch 
against 18 to fix that, but it would also be good to place this in 17 because 
18 is not getting released for a while.


> enabling BLOCK compression for map outputs breaks the reduce progress counters
> ------------------------------------------------------------------------------
>
>                 Key: HADOOP-3131
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3131
>             Project: Hadoop Core
>          Issue Type: Bug
>    Affects Versions: 0.17.0, 0.17.1, 0.18.0
>            Reporter: Colin Evans
>         Attachments: merge-progress.patch, Picture 1.png
>
>
> Enabling map output compression and setting the compression type to BLOCK 
> causes the progress counters during the reduce to go crazy and report 
> progress counts over 100%.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to