Norbert Luksa has uploaded this change for review. ( http://gerrit.cloudera.org:8080/15403
Change subject: IMPALA-6505: Min-Max predicate push down in ORC scanner ...................................................................... IMPALA-6505: Min-Max predicate push down in ORC scanner This commit implements min/max predicate pushdown for the ORC scanner leveraging on the external ORC library's search arguments. We build the search arguments when we open the scanner as we need not to modify them later. Also added a query option orc_read_statistics. If the option is set to true (it is by default) predicate pushdown will take effect, otherwise it will be skipped. Tests: - Run scanner tests on ORC files. - TODO: test performance - TODO: run test_scanners_fuzz - Bump ORC version since predicate pushdown is not yet implemented in the upstream ORC lib (in review). Change-Id: I136622413db21e0941d238ab6aeea901a6464845 --- M be/src/exec/hdfs-orc-scanner.cc M be/src/exec/hdfs-orc-scanner.h M be/src/exec/orc-metadata-utils.cc M be/src/exec/orc-metadata-utils.h M be/src/exprs/scalar-expr.h M be/src/service/query-options.cc M be/src/service/query-options.h M common/thrift/ImpalaInternalService.thrift M common/thrift/ImpalaService.thrift M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java 10 files changed, 228 insertions(+), 8 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/03/15403/1 -- To view, visit http://gerrit.cloudera.org:8080/15403 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: I136622413db21e0941d238ab6aeea901a6464845 Gerrit-Change-Number: 15403 Gerrit-PatchSet: 1 Gerrit-Owner: Norbert Luksa <[email protected]>
