There is a discussion [1] about moving the Ray SQL project [2] from the `datafusion-contrib/ray-sql` GitHub repository into a new `apache/datafusion-ray` repository.
The new DataFusion Ray project will provide a Python-first distributed version of DataFusion that can scale out on Ray clusters, providing a lighter-weight alternative to the DataFusion Ballista project, which is no longer actively maintained. The DataFusion Ray code base is ~1.7k lines of code, compared to ~27k lines of code in Ballista, and has very similar performance for the TPC-H benchmarks [3]. Only votes from PMC members are binding, but all members of the community are encouraged to participate and vote with "(non-binding)". The vote will run for at least seven days. [ ] +1 Accept donation of Ray SQL code base [ ] +0 [ ] -1 Do not accept donation of Ray SQL code base because... Here is my vote: +1 [1] https://github.com/apache/datafusion-python/issues/872 [2] https://github.com/datafusion-contrib/ray-sql [3] https://github.com/apache/datafusion-python/issues/872#issuecomment-2362554275