Thanks for sharing, Raymond! this is a very useful initiative for the community.
On Tue, Jul 9, 2024 at 1:42 AM Shiyan Xu <xu.shiyan.raym...@gmail.com> wrote: > Hi all, > > As some of you may have already noticed, a new repository for Hudi has been > created: https://github.com/apache/hudi-rs/ > > This repository hosts a native Rust implementation of Hudi and provides > Python APIs on top of it. The goal is to enable Hudi to integrate with a > broader range of frameworks and better serve users' needs, specifically in > the Rust/Python/AI/ML ecosystem. > > In its debut, the functionalities are limited to: > > - COW table snapshots and time-travel reads > - Rust and Python APIs for retrieving table schemas and reading tables > into RecordBatch (Arrow) > - Integration with Apache DataFusion > - Support for AWS, Azure, and GCP cloud storage schemes > > Leveraging this foundation, we have more integration work planned within > the ecosystem. In the meantime, it's important to establish a release > cadence and share it with the community. > The immediate next few steps are: > - follow ASF guidelines and define release process for both Rust and Python > - upload artifacts as per the process > - start a VOTE thread for the first release candidate 0.1.0-rc1 > > Cheers > > -- > Best, > Shiyan >