[ https://issues.apache.org/jira/browse/TEZ-3168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15200619#comment-15200619 ]
Bikas Saha commented on TEZ-3168: --------------------------------- For all of the problems with queue capacity, IMO cluster capacity is a more stable metric to look at. Logically, the data is distributed across the cluster and so accounting for that dispersion while calculating splits. This also solves the current immediate problem of creating too small splits. Essentially the job wants to run tasks across all cluster nodes. The queue capacity determines how the job gets waves/windows of tasks that move around the cluster to read that data locally. > Provide a more predictable approach for total resource guidance for > wave/split calculation > ------------------------------------------------------------------------------------------- > > Key: TEZ-3168 > URL: https://issues.apache.org/jira/browse/TEZ-3168 > Project: Apache Tez > Issue Type: Bug > Reporter: Hitesh Shah > Assignee: Hitesh Shah > Attachments: TEZ-3168.wip.2.patch, TEZ-3168.wip.patch > > > Currently, Tez uses headroom for checking total available resources. This is > flaky as it ends up causing the split count to be determined by a point in > time lookup at what is available in the cluster. A better approach would be > either the queue size or even cluster size to get a more predictable count. -- This message was sent by Atlassian JIRA (v6.3.4#6332)