[ 
https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-1983:
--------------------------------
    Fix Version/s:     (was: 0.9.0)
                   0.10.0

> [Python] Add ability to write parquet `_metadata` file
> ------------------------------------------------------
>
>                 Key: ARROW-1983
>                 URL: https://issues.apache.org/jira/browse/ARROW-1983
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Jim Crist
>            Priority: Major
>             Fix For: 0.10.0
>
>
> Currently `pyarrow.parquet` can only write the `_common_metadata` file 
> (mostly just schema information). It would be useful to add the ability to 
> write a `_metadata` file as well. This should include information about each 
> row group in the dataset, including summary statistics. Having this summary 
> file would allow filtering of row groups without needing to access each file 
> beforehand.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to