[
https://issues.apache.org/jira/browse/ARROW-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-5258:
-----------------------------------------
Labels: parquet (was: )
> [C++/Python] Expose file metadata of dataset pieces to caller
> -------------------------------------------------------------
>
> Key: ARROW-5258
> URL: https://issues.apache.org/jira/browse/ARROW-5258
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Python
> Reporter: Pearu Peterson
> Assignee: Pearu Peterson
> Priority: Major
> Labels: parquet
>
> This issue resolves partly the issue raised in ARROW-1983 by providing a way
> to collect FileMetaData objects of dataset pieces created via
> `write_to_dataset` function:
> {noformat}
> metadata_list = []
> write_to_dataset(table, ..., metadata_collector=metadata_list)
> {noformat}
> where `metadata_list` will contain instances of file metadata objects of
> dataset pieces.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)