Implement a memory-to-memory sort in the map task
-------------------------------------------------

                 Key: HADOOP-5870
                 URL: https://issues.apache.org/jira/browse/HADOOP-5870
             Project: Hadoop Core
          Issue Type: Improvement
          Components: mapred
            Reporter: Arun C Murthy


The motivation is similar to HADOOP-5831...

Currently we collect map-outputs in the sort buffer (io.sort.mb) which we 
eventually sort and spill to disk. For latency-sensitive applications with 
sufficient memory, e.g. terasort, we could do better by doing a 
memory-to-memory sort followed by a final memory-to-disk merge.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to