[
https://issues.apache.org/jira/browse/DRILL-6570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16528514#comment-16528514
]
ASF GitHub Bot commented on DRILL-6570:
---------------------------------------
sachouche opened a new pull request #1354: DRILL-6570: Fixed
IndexOutofBoundException in Parquet Reader
URL: https://github.com/apache/drill/pull/1354
Reserving same size intermediary buffers to handle the case of
false-positive; that is, a column is first thought to be fixed length (after
sampling), then reverted to variable length as we read more values.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> IndexOutOfBoundsException when using Flat Parquet Reader
> ---------------------------------------------------------
>
> Key: DRILL-6570
> URL: https://issues.apache.org/jira/browse/DRILL-6570
> Project: Apache Drill
> Issue Type: Improvement
> Components: Storage - Parquet
> Reporter: salim achouche
> Assignee: salim achouche
> Priority: Major
> Fix For: 1.14.0
>
>
> * The Parquet Reader creates a reusable bulk entry based on the column
> precision
> * It uses the column precision for optimizing the intermediary heap buffers
> * It first detected the column was fixed length but then it reverted this
> assumption when the column changed precision
> * This step was fine except the bulk entry memory requirement changed though
> the code didn't update the bulk entry intermediary buffers
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)