GitHub user jinfengni opened a pull request:
https://github.com/apache/drill/pull/637
Drill 1950 : Parquet row group filter pushdown.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jinfengni/incubator-drill DRILL-1950
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/drill/pull/637.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #637
----
commit e87f6c616bf8921867a283a2a46865e2eca5c917
Author: AdamPD <[email protected]>
Date: 2016-06-30T21:09:53Z
DRILL-1950: Use three new classes from Adam's patch : the rule and filter
builder.
commit 86fd14b19d8d1ac06a01361b8173d19e15541afd
Author: Jinfeng Ni <[email protected]>
Date: 2016-10-31T04:30:12Z
DRILL-1950: Update parquet metadata cache format to include both min/max
and additional column type information.
Parquet meta cache format change:
1. include both min/max in ColumnMetaData if column statistics is
available,
2. include precision/scale/repetionLevel in ColumnTypeMetaData
(precision/scale is for future use).
commit 242c45e1b1b531a1c7ec3c50438e6a33fe97a711
Author: Jinfeng Ni <[email protected]>
Date: 2016-10-31T04:30:37Z
DRILL-1950: Parquet rowgroup level filter pushdown.
Implement Parquet rowgroup level filter pushdown. The filter pushdown is
performed in
query planning time, in Drill physical planning phase.
Only a local filter, which refers to columns in a single table, is
qualified for filter pushdown.
A filter may be elligible if it is a simple compare filter, or a compound
"and/or" filter consists of
simple compare filter. Data types allowed in compare filter are int,
bigint, float, double, date,
timestamp, time. Compare operators are =, !=, <, <=, >, >=. Operands have
to be a column of the above
data types, or an explicit cast or implicit cast function, or a constant
expressions.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---