Merge result file size should honor hive.merge.size.per.task
------------------------------------------------------------

                 Key: HIVE-2037
                 URL: https://issues.apache.org/jira/browse/HIVE-2037
             Project: Hive
          Issue Type: Bug
            Reporter: Ning Zhang
            Assignee: Ning Zhang
         Attachments: HIVE-2037.patch

The merge job set mapred.min.split.size to the value of 
hive.merge.size.per.task, which roughly equals to the output file size. However 
the input split size is also determined by mapred.min.split.size.per.node, 
mapred.min.split.size.per.rack, and mapred.max.split.size. They should be set 
the same as hive.merge.size.per.task as well.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to