[
https://issues.apache.org/jira/browse/PARQUET-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gabor Szadovszky updated PARQUET-1365:
--------------------------------------
Description: Page level statistics are never used in production and after
adding column indexes they are completely useless. Fortunately, statistics are
optional in both the v1 and v2 pages therefore, we can safely stop writing
them. (was: Page level statistics are never used in production and after
adding column indexes they are completely useless. In page_v2 we are not able
to remove the unnecessary statistics as they are required fields in the page
header but in page_v1 it is optional. Let's get rid of the unnecessary and
optional fields in the page headers.)
> Don't write page level statistics for v1
> ----------------------------------------
>
> Key: PARQUET-1365
> URL: https://issues.apache.org/jira/browse/PARQUET-1365
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Reporter: Gabor Szadovszky
> Assignee: Gabor Szadovszky
> Priority: Major
> Fix For: 1.11.0
>
>
> Page level statistics are never used in production and after adding column
> indexes they are completely useless. Fortunately, statistics are optional in
> both the v1 and v2 pages therefore, we can safely stop writing them.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)