Qifan Chen has uploaded this change for review. ( http://gerrit.cloudera.org:8080/17478
Change subject: [WIP] IMPALA-10709: Min/max filters should be enabled for joins into sorted columns in Parquet tables ...................................................................... [WIP] IMPALA-10709: Min/max filters should be enabled for joins into sorted columns in Parquet tables This change set enables min/max filters for equi-joins into sort by columns in a Parquet table by default. This is to take the addvantage of the min/max values being fully sorted in each data file for the table. When there are multiple sort by columns in the table, only the leading column will be assigned a min/max filter. When the query option minmax_filter_threshold is 0, the patch automatically assigns a reasonable value for it. Otherwise, the threshold is used as is. The patch also automatically selects PAGE to be the filtering level when query option minmax_filter_threshold is 0. Otherwise, the filter level is taken from the query option minmax_filtering_level as is. Change-Id: I28c19c4b39b01ffa7d275fb245be85c28e9b2963 --- M be/src/service/query-options.cc M be/src/service/query-options.h M common/thrift/ImpalaService.thrift M common/thrift/Query.thrift M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java M fe/src/main/java/org/apache/impala/planner/RuntimeFilterGenerator.java 7 files changed, 53 insertions(+), 4 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/78/17478/4 -- To view, visit http://gerrit.cloudera.org:8080/17478 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: I28c19c4b39b01ffa7d275fb245be85c28e9b2963 Gerrit-Change-Number: 17478 Gerrit-PatchSet: 4 Gerrit-Owner: Qifan Chen <[email protected]>
