[jira] [Commented] (MAPREDUCE-2722) Gridmix simulated job's map's hdfsBytesRead counter is wrong when compressed input is used

Ravi Gummadi (JIRA) Fri, 22 Jul 2011 04:45:31 -0700

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13069513#comment-13069513
 ]


Ravi Gummadi commented on MAPREDUCE-2722:
-----------------------------------------

Just to explain the problem further, with trunk, here is an example table of 
counters for a map task of a job(Compression ratio considered by Gridmix to 
generate input data is say 0.5):

Counter        originalJob's Map Task                   Gridmix simulated job's 
map task
HdfsBytesRead        100MB                                    50MB
MapInputBytes        1000MB                                  100MB

Since emulation of correct disk IO is more important for Gridmix, emulation of 
hdfsBytesRead is needed/important.

> Gridmix simulated job's map's hdfsBytesRead counter is wrong when compressed 
> input is used
> ------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2722
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2722
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/gridmix
>            Reporter: Ravi Gummadi
>            Assignee: Ravi Gummadi
>
> When compressed input was used by original job's map task, then the simulated 
> job's map task's hdfsBytesRead counter is wrong if compression emulation is 
> enabled. This issue is because hdfsBytesRead of map task of original job is 
> considered as uncompressed map input size by Gridmix.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAPREDUCE-2722) Gridmix simulated job's map's hdfsBytesRead counter is wrong when compressed input is used

Reply via email to