Riza Suminto has uploaded this change for review. ( http://gerrit.cloudera.org:8080/20406
Change subject: IMPALA-12395: Override scan cardinality for optimized count star ...................................................................... IMPALA-12395: Override scan cardinality for optimized count star The cardinality estimate in HdfsScanNode.java for count queries does not account for the fact that the count optimization only scans metadata and not the actual columns. Optimized count star scan will return only 1 row per parquet row group. This patch override the scan cardinality with total number of files, which is the closest estimate to number of row group. Similar override already exist in IcebergScanNode.java. Testing: - Add count query testcases in test_query_cpu_count_divisor_default - Pass core tests Change-Id: Id5ce967657208057d50bd80adadac29ebb51cbc5 --- M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java M testdata/workloads/functional-planner/queries/PlannerTest/resource-requirements.test M tests/custom_cluster/test_executor_groups.py M tests/custom_cluster/test_query_retries.py 4 files changed, 73 insertions(+), 23 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/06/20406/1 -- To view, visit http://gerrit.cloudera.org:8080/20406 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Id5ce967657208057d50bd80adadac29ebb51cbc5 Gerrit-Change-Number: 20406 Gerrit-PatchSet: 1 Gerrit-Owner: Riza Suminto <[email protected]>
