[ 
https://issues.apache.org/jira/browse/PARQUET-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Wu resolved PARQUET-2413.
------------------------------
    Fix Version/s: 1.14.0
         Assignee: Claire McGinty
       Resolution: Fixed

> Support custom file footer metadata via ParquetWriter
> -----------------------------------------------------
>
>                 Key: PARQUET-2413
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2413
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Claire McGinty
>            Assignee: Claire McGinty
>            Priority: Minor
>             Fix For: 1.14.0
>
>
> Currently Parquet supports extra metadata keys passed via 
> [WriteContext|https://github.com/apache/parquet-mr/blob/945836c79b5bd3003512ace9e2d30d4cd03422f3/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/api/WriteSupport.java#L49-L54].
>  However, in practice these are limited to implementation-specific values 
> (i.e. AvroWriteSupport adds a 
> [key|https://github.com/apache/parquet-mr/blob/945836c79b5bd3003512ace9e2d30d4cd03422f3/parquet-avro/src/main/java/org/apache/parquet/avro/AvroWriteSupport.java#L153-L154]
>  containing the serialized Avro schema), and cannot be overridden or extended 
> by the end user.
>  
> Maybe we could extend the ParquetWriter builder to support this? For example:
>  
> ```
> AvroParquetWriter.Builder<T> builder = 
> AvroParquetWriter.<T>builder(outputFile)
>     .withConf(...)
>     .withPageSize(...)
> +  .withExtraMetadata(Map.of(..))
>     .build()
> ```



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to