Dear community, Nice to share Hudi community bi-weekly updates for 2022-07-18 ~ 2022-07-31 with updates on bug fixes.
======================================= Features [Core] Add FileBasedLockProvider [1] [Spark] Allow loading external configs while querying Hudi tables with Spark [2] [Spark] Add sync validate procedure [3] [Spark] Support Hudi with Spark 3.3.0 [4] [1] https://issues.apache.org/jira/browse/HUDI-4065 [2] https://issues.apache.org/jira/browse/HUDI-3764 [3] https://issues.apache.org/jira/browse/HUDI-3510 [4] https://issues.apache.org/jira/browse/HUDI-4186 ======================================= Bugs [Spark] Porting Nested Schema Pruning optimization for Hudi's custom Relations [1] [Spark] Replacing UDF in Bulk Insert w/ RDD transformation [2] [Spark] Fix missing bloom filters in metadata table in non-partitioned table [3] [Spark] Fix insert into dynamic partition write misalignment [4] [Spark] Make NONE sort mode as default for bulk insert [5] [Spark] fix merge into sql data quality in concurrent scene [6] [Core] Optimize performance of Column Stats Index reading in Data Skipping [7] [Spark] Addressing Spark SQL vs Spark DS performance gap [8] [1] https://issues.apache.org/jira/browse/HUDI-3896 [2] https://issues.apache.org/jira/browse/HUDI-3993 [3] https://issues.apache.org/jira/browse/HUDI-4400 [4] https://issues.apache.org/jira/browse/HUDI-4404 [5] https://issues.apache.org/jira/browse/HUDI-4071 [6] https://issues.apache.org/jira/browse/HUDI-4348 [7] https://issues.apache.org/jira/browse/HUDI-4250 [8] https://issues.apache.org/jira/browse/HUDI-4081 Best, Leesf