GitHub user dprofeta opened a pull request:
https://github.com/apache/drill/pull/949
DRILL-5795: Parquet Filter push down at rowgroup level
Before this commit, the filter was pruning complete files. When a file
is composed of multiple rowgroups, it was not able to prune one
rowgroup from the file. Now, when the filter find that a rowgroup
doesn't match it will be remove from the scan.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/dprofeta/drill drill-5795
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/drill/pull/949.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #949
----
commit eed3395647b10d06edf86ba4378995e9fd8da83d
Author: Damien Profeta <[email protected]>
Date: 2017-09-15T18:01:58Z
Parquet Filter push down now work at rowgroup level
Before this commit, the filter was pruning complete files. When a file
is composed of multiple rowgroups, it was not able to prune one
rowgroup from the file. Now, when the filter find that a rowgroup
doesn't match it will be remove from the scan.
----
---