[
https://issues.apache.org/jira/browse/DRILL-5797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16504566#comment-16504566
]
Oleksandr Kalinin commented on DRILL-5797:
------------------------------------------
Current state:
- several ParquetReaderUtility classes have been updated for nested schema file
(that required also minor addition to SchemaPath - a method to get full path as
String)
- fieldSelected fixed
- case sensitivity issue fixed
- Unittests and complex.q queries are OK
All in all it has become more complex change that original PR, I am currently
cleaning up the code for github upload and initial review.
> Use more often the new parquet reader
> -------------------------------------
>
> Key: DRILL-5797
> URL: https://issues.apache.org/jira/browse/DRILL-5797
> Project: Apache Drill
> Issue Type: Improvement
> Components: Storage - Parquet
> Reporter: Damien Profeta
> Assignee: Oleksandr Kalinin
> Priority: Major
> Fix For: 1.14.0
>
>
> The choice of using the regular parquet reader of the optimized one is based
> of what type of columns is in the file. But the columns that are read by the
> query doesn't matter. We can increase a little bit the cases where the
> optimized reader is used by checking is the projected column are simple or
> not.
> This is an optimization waiting for the fast parquet reader to handle complex
> structure.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)