[jira] [Commented] (DRILL-5797) Use more often the new parquet reader

Oleksandr Kalinin (JIRA) Thu, 07 Jun 2018 04:56:23 -0700


    [ 
https://issues.apache.org/jira/browse/DRILL-5797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16504566#comment-16504566
 ]


Oleksandr Kalinin commented on DRILL-5797:
------------------------------------------

Current state:

- several ParquetReaderUtility classes have been updated for nested schema file 
(that required also minor addition to SchemaPath - a method to get full path as 
String)
- fieldSelected fixed
- case sensitivity issue fixed
- Unittests and complex.q queries are OK

All in all it has become more complex change that original PR, I am currently 
cleaning up the code for github upload and initial review.

> Use more often the new parquet reader
> -------------------------------------
>
>                 Key: DRILL-5797
>                 URL: https://issues.apache.org/jira/browse/DRILL-5797
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Parquet
>            Reporter: Damien Profeta
>            Assignee: Oleksandr Kalinin
>            Priority: Major
>             Fix For: 1.14.0
>
>
> The choice of using the regular parquet reader of the optimized one is based 
> of what type of columns is in the file. But the columns that are read by the 
> query doesn't matter. We can increase a little bit the cases where the 
> optimized reader is used by checking is the projected column are simple or 
> not.
> This is an optimization waiting for the fast parquet reader to handle complex 
> structure.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (DRILL-5797) Use more often the new parquet reader

Reply via email to