[ https://issues.apache.org/jira/browse/HADOOP-1965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Amar Kamat updated HADOOP-1965: ------------------------------- Attachment: HADOOP-1965-Benchmark.patch The earlier two patches work around the problem of generating random data in the benchmark. Resubmitting the final patch (the only change in this patch is that the key size can be configured). > Handle map output buffers better > -------------------------------- > > Key: HADOOP-1965 > URL: https://issues.apache.org/jira/browse/HADOOP-1965 > Project: Hadoop > Issue Type: Improvement > Components: mapred > Affects Versions: 0.16.0 > Reporter: Devaraj Das > Assignee: Amar Kamat > Fix For: 0.16.0 > > Attachments: 1965_single_proc_150mb_gziped.jpeg, > 1965_single_proc_150mb_gziped.pdf, 1965_single_proc_150mb_gziped_breakup.png, > HADOOP-1965-1.patch, HADOOP-1965-Benchmark.patch, > HADOOP-1965-Benchmark.patch, HADOOP-1965-Benchmark.patch, > HADOOP-1965-Benchmark.patch > > > Today, the map task stops calling the map method while sort/spill is using > the (single instance of) map output buffer. One improvement that can be done > to improve performance of the map task is to have another buffer for writing > the map outputs to, while sort/spill is using the first buffer. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.