MapOutputBuffer allocates 4x as much space to record capacity as intended
-------------------------------------------------------------------------

                 Key: HADOOP-3475
                 URL: https://issues.apache.org/jira/browse/HADOOP-3475
             Project: Hadoop Core
          Issue Type: Bug
    Affects Versions: 0.17.0
            Reporter: Chris Douglas
            Assignee: Chris Douglas
             Fix For: 0.18.0


When setting up its accounting tables, MapTask.MapOutputBuffer calculates the 
size of each of its buffers as:
{noformat}
Where #total_bytes is $io.sort.mb * 2^20 and #accounting_bytes is 
$io.sort.record.percent * #total_bytes
kvbuffer: #total_bytes - #accounting_bytes
kvoffsets: 1/4 of #accounting_bytes
kvindices: 3/4 of #accounting_bytes
{noformat}

kvoffsets and kvindices are ints, not bytes; the size of both arrays is 
incorrectly calculated.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to