Jim Crist created ARROW-1983:
--------------------------------

             Summary: [Python] Add ability to write parquet `_metadata` file
                 Key: ARROW-1983
                 URL: https://issues.apache.org/jira/browse/ARROW-1983
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Jim Crist


Currently `pyarrow.parquet` can only write the `_common_metadata` file (mostly 
just schema information). It would be useful to add the ability to write a 
`_metadata` file as well. This should include information about each row group 
in the dataset, including summary statistics. Having this summary file would 
allow filtering of row groups without needing to access each file beforehand.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to