Dear community, Nice to share Hudi community bi-weekly updates for 2020-01-17 ~ 2021-01-31 with updates on features, bug fixes and tests.
======================================= [Release] Apache Hudi 0.7.0 Released, this is a major release with many features, you would check the release notes for more details [1] [1] http://hudi.apache.org/docs/0.7.0-quick-start-guide.html ======================================= Features [Metadata] Add block size to the FileStatus objects returned from metadata table to avoid too many file splits [1] [Metadata] Harden RFC-15 Implementation based on production testing [2] [Flink Integration] InstantGenerateOperator support multiple parallelism [3] [Flink Integration] Introduce FlinkHoodieBloomIndex to hudi-flink-client [4] [Core] Insert new records to data files without merging for "Insert" operation [5] [Flink Integration] Add a new pipeline for Flink writer [6] [Clustering] Remove isEmpty to improve clustering execution performance [7] [1] https://issues.apache.org/jira/browse/HUDI-1529 [2] https://issues.apache.org/jira/browse/HUDI-1308 [3] https://issues.apache.org/jira/browse/HUDI-1511 [4] https://issues.apache.org/jira/browse/HUDI-1332 [5] https://issues.apache.org/jira/browse/HUDI-1234 [6] https://issues.apache.org/jira/browse/HUDI-1522 [7] https://issues.apache.org/jira/browse/HUDI-1555 ======================================= Bugs [Core] Make SerializableSchema work for large schemas and add ability to sortBy numeric values [1] [Core] Fixed suboptimal implementation of a magic sequence search [2] [Spark Integration] Fixing commons codec shading in spark bundle [3] [Flink Integration] Fix NPE using HoodieFlinkStreamer to etl data from kafka to hudi [4] [Core] Remove UpgradePayloadFromUberToApache [5] [1] https://issues.apache.org/jira/browse/HUDI-1553 [2] https://issues.apache.org/jira/browse/HUDI-1532 [3] https://issues.apache.org/jira/browse/HUDI-1540 [4] https://issues.apache.org/jira/browse/HUDI-1453 [5] https://issues.apache.org/jira/browse/HUDI-623 ======================================= Tests [Tests] Fix spark 2 unit tests failure with Spark 3 [1] [Tests] Introduce unit test infra for java client [2] [Tests] Add unit test for validating replacecommit rollback [3] [1] https://issues.apache.org/jira/browse/HUDI-1512 [2] https://issues.apache.org/jira/browse/HUDI-1476 [3] https://issues.apache.org/jira/browse/HUDI-1266 Best, Leesf