[
https://issues.apache.org/jira/browse/MAPREDUCE-2722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13069513#comment-13069513
]
Ravi Gummadi commented on MAPREDUCE-2722:
-----------------------------------------
Just to explain the problem further, with trunk, here is an example table of
counters for a map task of a job(Compression ratio considered by Gridmix to
generate input data is say 0.5):
Counter originalJob's Map Task Gridmix simulated job's
map task
HdfsBytesRead 100MB 50MB
MapInputBytes 1000MB 100MB
Since emulation of correct disk IO is more important for Gridmix, emulation of
hdfsBytesRead is needed/important.
> Gridmix simulated job's map's hdfsBytesRead counter is wrong when compressed
> input is used
> ------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-2722
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2722
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/gridmix
> Reporter: Ravi Gummadi
> Assignee: Ravi Gummadi
>
> When compressed input was used by original job's map task, then the simulated
> job's map task's hdfsBytesRead counter is wrong if compression emulation is
> enabled. This issue is because hdfsBytesRead of map task of original job is
> considered as uncompressed map input size by Gridmix.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira