[I] Revisit dropped Partition Columns handling [hudi]

via GitHub Sat, 29 Nov 2025 21:49:00 -0800


hudi-bot opened a new issue, #15407:
URL: https://github.com/apache/hudi/issues/15407


   Currently, dropping partition columns (controlled by 
"hoodie.datasource.write.drop.partition.columns") is handled in a piecemeal 
fashion, which unfortunately may to lead to very subtle and hard to 
troubleshoot issues when used.
   
   For ex, currently in HoodieSparkSqlWriter this would affect what will be 
persisted as writer's schema – in case partition columns are dropped from the 
data file we will persist "reduced" schema as the one that was used by the 
Writer, which is invalid since Writer was using the full schema, however 
partition columns weren't persisted in the Data Files (ie dropped, since 
they're already encoded into the partition path)
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-4772
   - Type: Bug
   - Affects version(s):
     - 0.13.0


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] Revisit dropped Partition Columns handling [hudi]

Reply via email to