[
https://issues.apache.org/jira/browse/IMPALA-9317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Armstrong resolved IMPALA-9317.
-----------------------------------
Fix Version/s: Impala 4.0
Resolution: Fixed
This was fixed as part of IMPALA-9156 in HdfsScanNode.java
> Improve number of instances estimate for scans in planner
> ---------------------------------------------------------
>
> Key: IMPALA-9317
> URL: https://issues.apache.org/jira/browse/IMPALA-9317
> Project: IMPALA
> Issue Type: Improvement
> Components: Frontend
> Reporter: Tim Armstrong
> Assignee: Tim Armstrong
> Priority: Major
> Labels: multithreading
> Fix For: Impala 4.0
>
>
> Currently the number of instances (used for various estimates) is calculated
> as estimated # hosts * mt_dop. This is suboptimal, because in many cases
> there isn't enough input data to allow creating that many instances per host.
> The per-host estimate for scans actually already handles this correctly by
> capping the hosts estimate at the number of input splits - we should use the
> same approach for number of instance estimates.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)