[
https://issues.apache.org/jira/browse/PARQUET-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16510053#comment-16510053
]
ASF GitHub Bot commented on PARQUET-1309:
-----------------------------------------
rdblue commented on issue #490: PARQUET-1309: Parquet Java uses incorrect stats
and dictionary filter properties
URL: https://github.com/apache/parquet-mr/pull/490#issuecomment-396701452
Thank you for fixing this, @gszadovszky!
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Parquet Java uses incorrect stats and dictionary filter properties
> ------------------------------------------------------------------
>
> Key: PARQUET-1309
> URL: https://issues.apache.org/jira/browse/PARQUET-1309
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Reporter: Ryan Blue
> Assignee: Gabor Szadovszky
> Priority: Major
> Fix For: 1.10.1
>
>
> In SPARK-24251, we found that the changes to use HadoopReadOptions
> accidentally switched the [properties that enable stats and dictionary
> filters|https://github.com/apache/parquet-mr/blob/8bbc6cb95fd9b4b9e86c924ca1e40fd555ecac1d/parquet-hadoop/src/main/java/org/apache/parquet/HadoopReadOptions.java#L83].
> Both are enabled by default so it is unlikely that anyone will need to turn
> them off and there is an easy work-around, but we should fix the properties
> for 1.10.1. This doesn't affect the 1.8.x or 1.9.x releases (Spark 2.3.x is
> on 1.8.x).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)