Hello Henry Robinson, Tim Armstrong, Dan Hecht, I'd like you to reexamine a change. Please visit
http://gerrit.cloudera.org:8080/8684 to look at the new patch set (#2). Change subject: IMPALA-3804: Re-enable per-scan filtering for sequence-based scanners ...................................................................... IMPALA-3804: Re-enable per-scan filtering for sequence-based scanners IMPALA-3798 disabled per-scan filtering for sequence- based scanners due to a race between runtime filter arrival and header splits processing. This commit enables per-scan filtering again for the sequence based files. In HdfsScanNode::ProcessSplit() we check if the current range is the header of a sequence file. If so, and the filters reject the file, the whole file skipped. If it is not a sequence header, but the filters reject the partition, we call RangeComplete() on the current scan range. Change-Id: I4b38c26bcbe67f83efcc65a1723d766626ae3d3e --- M be/src/exec/base-sequence-scanner.h M be/src/exec/hdfs-scan-node-base.cc M be/src/exec/hdfs-scan-node-base.h M be/src/exec/hdfs-scan-node.cc M be/src/exec/hdfs-scanner.cc 5 files changed, 39 insertions(+), 22 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/84/8684/2 -- To view, visit http://gerrit.cloudera.org:8080/8684 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newpatchset Gerrit-Change-Id: I4b38c26bcbe67f83efcc65a1723d766626ae3d3e Gerrit-Change-Number: 8684 Gerrit-PatchSet: 2 Gerrit-Owner: Zoltan Borok-Nagy <borokna...@cloudera.com> Gerrit-Reviewer: Dan Hecht <dhe...@cloudera.com> Gerrit-Reviewer: Henry Robinson <he...@cloudera.com> Gerrit-Reviewer: Tim Armstrong <tarmstr...@cloudera.com>