Nils Schlicher commented on DRILL-7399:

I got the same problem in 1.17.0 with my parquet data


The following query returns wrong results. Using the new reader solves the 
problem, but the new reader is much slower than the default one.

 {{SELECT COUNT( * ) FROM data WHERE measurement_point_valid= false;}}

> Querying parquet file with boolean data type return wrong results
> -----------------------------------------------------------------
>                 Key: DRILL-7399
>                 URL: https://issues.apache.org/jira/browse/DRILL-7399
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>    Affects Versions: 1.16.0
>            Reporter: Fabian Barreiro
>            Priority: Critical
>         Attachments: newrule22_3_1.parquet
> The following query return a wrong value for the boolean column press_run_1:
>  SELECT * FROM dfs.root.`/tmp/newrule22_3_1.parquet` WHERE cycle_id=23435119
> The query return press_run_1 = 'false'
> the parquet file contain pess_run_1 = 'true' value for this record.
> You can find many records with this problem if try different selects.
> ATTACHED:  newrule22_3_1.parquet file.

This message was sent by Atlassian Jira

Reply via email to