[
https://issues.apache.org/jira/browse/TEZ-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16832940#comment-16832940
]
Todd Lipcon commented on TEZ-3310:
----------------------------------
Sure enough, this is causing problems for pseudo-distributed-cluster testing.
The "min split length" config gets ignored because all of the splits are on
localhost, and thus queries have different behavior on this cluster than on a
remote one.
> Handle splits grouping better when locality information is not available (or
> only when localhost is available)
> --------------------------------------------------------------------------------------------------------------
>
> Key: TEZ-3310
> URL: https://issues.apache.org/jira/browse/TEZ-3310
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Rajesh Balamohan
> Priority: Minor
>
> This is a follow up JIRA to TEZ-3291. TEZ-3291 tries to handle the case when
> only localhost is specified in the locations. It would be good to improve
> handling of splits grouping when Tez does not have enough information about
> the locality.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)