[ 
https://issues.apache.org/jira/browse/DRILL-6744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arina Ielchiieva updated DRILL-6744:
------------------------------------
    Description: Since now Drill is using Apache Parquet 1.10.0 where issue 
with incorrectly stored varchar / decimal min / max statistics is resolved, we 
should add support for varchar / decimal filter push down. Only files created 
with parquet lib 1.9.1 (1.10.0)) and later will be subjected to push down. In 
cases if user knows that prior created files have correct min / max statistics 
(i.e. user exactly knows that data in binary columns in ASCII (not UTF-8)) than 
parquet.strings.signed-min-max.enabled can be set to true to enable filter push 
down.  (was: Since now Drill is using Apache Parquet 1.10.0 where issue with 
incorrectly stored varchar / decimal min / max statistics is resolved, we 
should add support for varchar / decimal filter push down. Only files created 
with parquet lib 1.9.1 and later will be subjected to push down. In cases if 
user knows that prior created files have correct min / max statistics (i.e. 
user exactly knows that data in binary columns in ASCII (not UTF-8)) than 
parquet.strings.signed-min-max.enabled can be set to true to enable filter push 
down.)

> Support filter push down for varchar / decimal data types
> ---------------------------------------------------------
>
>                 Key: DRILL-6744
>                 URL: https://issues.apache.org/jira/browse/DRILL-6744
>             Project: Apache Drill
>          Issue Type: Improvement
>    Affects Versions: 1.14.0
>            Reporter: Arina Ielchiieva
>            Assignee: Arina Ielchiieva
>            Priority: Major
>              Labels: doc-impacting
>             Fix For: 1.15.0
>
>
> Since now Drill is using Apache Parquet 1.10.0 where issue with incorrectly 
> stored varchar / decimal min / max statistics is resolved, we should add 
> support for varchar / decimal filter push down. Only files created with 
> parquet lib 1.9.1 (1.10.0)) and later will be subjected to push down. In 
> cases if user knows that prior created files have correct min / max 
> statistics (i.e. user exactly knows that data in binary columns in ASCII (not 
> UTF-8)) than parquet.strings.signed-min-max.enabled can be set to true to 
> enable filter push down.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to