[
https://issues.apache.org/jira/browse/MAPREDUCE-5947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jaehoon ko updated MAPREDUCE-5947:
----------------------------------
Component/s: task
> Map phase merge can better utilize memory
> -----------------------------------------
>
> Key: MAPREDUCE-5947
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-5947
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: performance, task
> Affects Versions: 2.4.0
> Reporter: jaehoon ko
> Assignee: jaehoon ko
> Labels: newbie, performance
>
> Map phase merge reads spills from disk and writes intermediate results back
> to disk, and so on. I think it is possible to use memory to store
> intermediate results, thereby reducing disk IO. Because kvbuffer is nullified
> right before merge, we have at least io.sort.mb amount of heap available.
> MAPREDUCE-4511 can be considered as an effort to utilize memory better
> through read ahead, but number of disk IO is unchanged.
> Please give me your thoughts. I'd like to take up this issue.
--
This message was sent by Atlassian JIRA
(v6.2#6252)