[
https://issues.apache.org/jira/browse/HUDI-7300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rohit Mittapalli updated HUDI-7300:
-----------------------------------
Status: In Progress (was: Open)
> Parquet DFS source should support merging schemas
> -------------------------------------------------
>
> Key: HUDI-7300
> URL: https://issues.apache.org/jira/browse/HUDI-7300
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: Rohit Mittapalli
> Priority: Minor
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> We should surface the option to merge schema across the parquet files in a
> single commit. when using ParquetDFSSource.
>
> When false the schema is randomly picked from a parquet file (current
> behavior). When set to true the schema across a commit is merged.
>
> https://spark.apache.org/docs/latest/sql-data-sources-parquet.html#schema-merging
--
This message was sent by Atlassian Jira
(v8.20.10#820010)