[
https://issues.apache.org/jira/browse/MAPREDUCE-3008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Amar Kamat updated MAPREDUCE-3008:
----------------------------------
Attachment: mapreduce-2591-v1.4.2.patch
Attaching a patch that improves CPU emulation for short running tasks. Areas of
improvements:
1. Sorter/Comparator now is CPU emulation aware
2. For tasks with no spills/merges, aggressive CPU emulation is done.
tets-patch and JUnit tests for Gridmix passed.
> [Gridmix] Improve cumulative CPU usage emulation for short running tasks
> ------------------------------------------------------------------------
>
> Key: MAPREDUCE-3008
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3008
> Project: Hadoop Map/Reduce
> Issue Type: Sub-task
> Components: contrib/gridmix
> Affects Versions: 0.24.0
> Reporter: Amar Kamat
> Labels: cpu-emulation, gridmix
> Fix For: 0.24.0
>
> Attachments: mapreduce-2591-v1.4.2.patch
>
>
> CPU emulation in Gridmix fails to meet the expected target if the map has no
> data to sort/spill/merge. There are 2 major reasons for this:
> 1. The map task end immediately ends soon after the map task. The map
> progress is 67% while the map phase ends.
> 2. Currently, the sort (comparator) doesnt emulate CPU. If the map is short
> lived, the CPU emulation thread (spawned from the map task in cleanup)
> doesn't get a chance to emulate.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira