[
https://issues.apache.org/jira/browse/ARROW-15552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17488330#comment-17488330
]
Antoine Pitrou commented on ARROW-15552:
----------------------------------------
This is probably a leftover from a draft version of the C data interface where
key-value metadata has a different encoding (perhaps JSON with base64-encoded
values).
> [Docs][Format] Unclear wording about base64 encoding requirement of metadata
> values
> -----------------------------------------------------------------------------------
>
> Key: ARROW-15552
> URL: https://issues.apache.org/jira/browse/ARROW-15552
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Documentation, Format
> Reporter: Joris Van den Bossche
> Priority: Major
>
> The C Data Interface docs indicate that the values in key-value metadata
> should be base64 encoded, which is mentioned in the section about which
> key-value metadata to use for extension types
> (https://arrow.apache.org/docs/format/CDataInterface.html#extension-arrays):
> bq. The base64 encoding of metadata values ensures that any possible
> serialization is representable.
> This might not be fully correct, though (or at least not required, which is
> implied with the current wording). While a binary blob (like a serialized
> schema) can be base64 encoded, as we do when putting the Arrow schema in the
> Parquet metadata, this is not required?
> cc [~apitrou]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)