[
https://issues.apache.org/jira/browse/HADOOP-3131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12604294#action_12604294
]
Matei Zaharia commented on HADOOP-3131:
---------------------------------------
Looking at the patch submitted for HADOOP-2095, it seems that it has the same
problem (by doing totalBytesProcessed += (key.getLength()-key.getPosition()) +
(value.getLength()-value.getPosition())). I can submit a separate patch
against 18 to fix that, but it would also be good to place this in 17 because
18 is not getting released for a while.
> enabling BLOCK compression for map outputs breaks the reduce progress counters
> ------------------------------------------------------------------------------
>
> Key: HADOOP-3131
> URL: https://issues.apache.org/jira/browse/HADOOP-3131
> Project: Hadoop Core
> Issue Type: Bug
> Affects Versions: 0.17.0, 0.17.1, 0.18.0
> Reporter: Colin Evans
> Attachments: merge-progress.patch, Picture 1.png
>
>
> Enabling map output compression and setting the compression type to BLOCK
> causes the progress counters during the reduce to go crazy and report
> progress counts over 100%.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.