Jörn Horstmann created ARROW-8231:
-------------------------------------

             Summary: Parse key_value_metadata from parquet FileMetaData into 
arrow schema metadata
                 Key: ARROW-8231
                 URL: https://issues.apache.org/jira/browse/ARROW-8231
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Rust
            Reporter: Jörn Horstmann


The parquet-format FileMetaData struct contains optional key value pairs with 
additional metadata about the schema:

[https://docs.rs/parquet-format/2.6.0/src/parquet_format/parquet_format.rs.html#3821]

When the parquet file was generated using the java avro parquet writer, this 
for example contains the original avro schema under the `parquet.avro.schema` 
or `avro.schema` keys.

It would be nice if this metadata was accessible through the 
`arrow::datatypes::Schema.metadata` field.

I'm willing to implement and create a pull request for this feature.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to