[
https://issues.apache.org/jira/browse/HUDI-7300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-7300:
---------------------------------
Labels: pull-request-available (was: )
> Parquet DFS source should support merging schemas
> -------------------------------------------------
>
> Key: HUDI-7300
> URL: https://issues.apache.org/jira/browse/HUDI-7300
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: Rohit Mittapalli
> Priority: Minor
> Labels: pull-request-available
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> We should surface the option to merge schema across the parquet files in a
> single commit. when using ParquetDFSSource.
>
> When false the schema is randomly picked from a parquet file (current
> behavior). When set to true the schema across a commit is merged.
>
> https://spark.apache.org/docs/latest/sql-data-sources-parquet.html#schema-merging
--
This message was sent by Atlassian Jira
(v8.20.10#820010)