[ 
https://issues.apache.org/jira/browse/TEZ-4130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

László Bodor updated TEZ-4130:
------------------------------
    Summary: Config for hard limiting the number of splits  (was: Config for 
max task parallelism in shuffle - 
tez.shuffle-vertex-manager.max-task-parallelism)

> Config for hard limiting the number of splits
> ---------------------------------------------
>
>                 Key: TEZ-4130
>                 URL: https://issues.apache.org/jira/browse/TEZ-4130
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: László Bodor
>            Assignee: László Bodor
>            Priority: Major
>         Attachments: TEZ-4130.01.patch
>
>
> During the investigation of a customer issue, I found that tez generated a 
> dag plan containing >4k tasks. It failed for hive because of bucket number 
> limitations (4k). It can be configured properly, e.g. bigger splits 
> (tez.grouping.min-size), but maybe it would be more convenient for users to 
> config a hard limit for the number of splits.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to