Wes McKinney created ARROW-14303:
------------------------------------
Summary: [C++][Parquet] Do not duplicate Schema metadata in
Parquet schema metadata and serialized ARROW:schema value
Key: ARROW-14303
URL: https://issues.apache.org/jira/browse/ARROW-14303
Project: Apache Arrow
Issue Type: Bug
Components: C++
Reporter: Wes McKinney
Fix For: 6.0.0
Metadata values are being duplicated in the Parquet file footer — we should
either only store them in ARROW:schema or the Parquet schema metadata. Removing
them from the Parquet schema metadata may break applications that are expecting
that metadata to be there when serialized from Arrow, so dropping the keys from
ARROW:schema is probably a safer choice
--
This message was sent by Atlassian Jira
(v8.3.4#803005)