[
https://issues.apache.org/jira/browse/TEZ-3552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ming Ma updated TEZ-3552:
-------------------------
Attachment: (was: TEZ-3552-2.patch)
> Shuffle split array when size-based sorting is turned off
> ---------------------------------------------------------
>
> Key: TEZ-3552
> URL: https://issues.apache.org/jira/browse/TEZ-3552
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Ming Ma
> Assignee: Zhiyuan Yang
> Attachments: TEZ-3552-2.patch, TEZ-3552.1.patch
>
>
> TEZ-3430 adds the functionality to skip size-based split sorting to help with
> job runtime. During further testing, the original split array for certain
> inputs before sorting aren't randomly distributed in size. So when the spit
> sorting is turned off, we should shuffle the split instead of doing nothing.
> That will make the size distribution more even.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)