[ 
https://issues.apache.org/jira/browse/PARQUET-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gabor Szadovszky updated PARQUET-1365:
--------------------------------------
    Description: Page level statistics are never used in production and after 
adding column indexes they are completely useless. Fortunately, statistics are 
optional in both the v1 and v2 pages therefore, we can safely stop writing 
them.  (was: Page level statistics are never used in production and after 
adding column indexes they are completely useless. In page_v2 we are not able 
to remove the unnecessary statistics as they are required fields in the page 
header but in page_v1 it is optional. Let's get rid of the unnecessary and 
optional fields in the page headers.)

> Don't write page level statistics for v1
> ----------------------------------------
>
>                 Key: PARQUET-1365
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1365
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>            Reporter: Gabor Szadovszky
>            Assignee: Gabor Szadovszky
>            Priority: Major
>             Fix For: 1.11.0
>
>
> Page level statistics are never used in production and after adding column 
> indexes they are completely useless. Fortunately, statistics are optional in 
> both the v1 and v2 pages therefore, we can safely stop writing them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to