[ 
https://issues.apache.org/jira/browse/ARROW-15552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Antoine Pitrou resolved ARROW-15552.
------------------------------------
    Fix Version/s: 8.0.0
       Resolution: Fixed

Issue resolved by pull request 12361
[https://github.com/apache/arrow/pull/12361]

> [Docs][Format] Unclear wording about base64 encoding requirement of metadata 
> values
> -----------------------------------------------------------------------------------
>
>                 Key: ARROW-15552
>                 URL: https://issues.apache.org/jira/browse/ARROW-15552
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Documentation, Format
>            Reporter: Joris Van den Bossche
>            Assignee: Antoine Pitrou
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 8.0.0
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> The C Data Interface docs indicate that the values in key-value metadata 
> should be base64 encoded, which is mentioned in the section about which 
> key-value metadata to use for extension types 
> (https://arrow.apache.org/docs/format/CDataInterface.html#extension-arrays):
> bq. The base64 encoding of metadata values ensures that any possible 
> serialization is representable.
> This might not be fully correct, though (or at least not required, which is 
> implied with the current wording). While a binary blob (like a serialized 
> schema) can be base64 encoded, as we do when putting the Arrow schema in the 
> Parquet metadata, this is not required?
> cc [~apitrou]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to