[
https://issues.apache.org/jira/browse/HIVE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ning Zhang updated HIVE-2037:
-----------------------------
Attachment: HIVE-2037.patch
> Merge result file size should honor hive.merge.size.per.task
> ------------------------------------------------------------
>
> Key: HIVE-2037
> URL: https://issues.apache.org/jira/browse/HIVE-2037
> Project: Hive
> Issue Type: Bug
> Reporter: Ning Zhang
> Assignee: Ning Zhang
> Attachments: HIVE-2037.patch
>
>
> The merge job set mapred.min.split.size to the value of
> hive.merge.size.per.task, which roughly equals to the output file size.
> However the input split size is also determined by
> mapred.min.split.size.per.node, mapred.min.split.size.per.rack, and
> mapred.max.split.size. They should be set the same as
> hive.merge.size.per.task as well.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira