[ 
https://issues.apache.org/jira/browse/ARROW-16430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17611680#comment-17611680
 ] 

Steve M. Kim edited comment on ARROW-16430 at 9/30/22 4:25 PM:
---------------------------------------------------------------

After further thought and re-reading the flatbuffer definitions and c++ 
implementation, I rescind my statement about custom metadata on a RecordBatch. 
The {{RecordBatch}} table doesn't have a {{custom_metadata}} field.

However, I still think that there is a confusion between the metadata of a 
{{pyarrow.Schema}} versus the metadata of a {{pyarrow.Message}} that contains a 
{{{}Schema{}}}. It ought to be possible to specify custom metadata when 
serializing a {{Schema}} as a IPC message, in exactly the same way that it 
ought to be possible to specify custom metadata when serializing a 
{{RecordBatch}} or {{DictionaryBatch}} as a IPC message. 


was (Author: chairmank):
After further thought and re-reading the flatbuffer definitions and c++ 
implementation, I rescind my statement about custom metadata on a RecordBatch. 
The {{RecordBatch}} table doesn't have a {{custom_metadata}} field.

However, I still think that there is a confusion between the metadata of a 
{{pyarrow.Schema}} versus the metadata of a {{pyarrow.Message}} that contains 
a{{ Schema.}} It ought to be possible to specify custom metadata when 
serializing a {{Schema}} as a IPC message, in exactly the same way that it 
ought to be possible to specify custom metadata when serializing a 
{{RecordBatch}} or {{DictionaryBatch}} as a IPC message. 

> [Python] Read/Write record batch custom metadata API in pyarrow
> ---------------------------------------------------------------
>
>                 Key: ARROW-16430
>                 URL: https://issues.apache.org/jira/browse/ARROW-16430
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 7.0.0
>            Reporter: Yue Ni
>            Assignee: Yue Ni
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 7h 20m
>  Remaining Estimate: 0h
>
> In https://issues.apache.org/jira/browse/ARROW-16131, Arrow C++ APIs were 
> added so that users can read/write record batch custom metadata for IPC file. 
> But pyarrow still lacks corresponding APIs for doing this.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to