[
https://issues.apache.org/jira/browse/ARROW-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17662281#comment-17662281
]
Rok Mihevc commented on ARROW-5258:
-----------------------------------
This issue has been migrated to [issue
#21729|https://github.com/apache/arrow/issues/21729] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [C++/Python] Expose file metadata of dataset pieces to caller
> -------------------------------------------------------------
>
> Key: ARROW-5258
> URL: https://issues.apache.org/jira/browse/ARROW-5258
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Python
> Reporter: Pearu Peterson
> Assignee: Pearu Peterson
> Priority: Major
> Labels: parquet, pull-request-available
> Fix For: 0.14.0
>
> Time Spent: 1h
> Remaining Estimate: 0h
>
> This issue resolves partly the issue raised in ARROW-1983 by providing a way
> to collect FileMetaData objects of dataset pieces created via
> `write_to_dataset` function:
> {noformat}
> metadata_list = []
> write_to_dataset(table, ..., metadata_collector=metadata_list)
> {noformat}
> where `metadata_list` will contain instances of file metadata objects of
> dataset pieces.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)