Jim Crist created ARROW-1983:
--------------------------------
Summary: [Python] Add ability to write parquet `_metadata` file
Key: ARROW-1983
URL: https://issues.apache.org/jira/browse/ARROW-1983
Project: Apache Arrow
Issue Type: Improvement
Components: Python
Reporter: Jim Crist
Currently `pyarrow.parquet` can only write the `_common_metadata` file (mostly
just schema information). It would be useful to add the ability to write a
`_metadata` file as well. This should include information about each row group
in the dataset, including summary statistics. Having this summary file would
allow filtering of row groups without needing to access each file beforehand.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)