Jörn Horstmann created ARROW-8231:
-------------------------------------
Summary: Parse key_value_metadata from parquet FileMetaData into
arrow schema metadata
Key: ARROW-8231
URL: https://issues.apache.org/jira/browse/ARROW-8231
Project: Apache Arrow
Issue Type: Improvement
Components: Rust
Reporter: Jörn Horstmann
The parquet-format FileMetaData struct contains optional key value pairs with
additional metadata about the schema:
[https://docs.rs/parquet-format/2.6.0/src/parquet_format/parquet_format.rs.html#3821]
When the parquet file was generated using the java avro parquet writer, this
for example contains the original avro schema under the `parquet.avro.schema`
or `avro.schema` keys.
It would be nice if this metadata was accessible through the
`arrow::datatypes::Schema.metadata` field.
I'm willing to implement and create a pull request for this feature.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)