Thank you to everyone who contributed to this quarters report. I have submitted to the ASF board and filed a ticket for the next report in June [1]
[1]: https://github.com/apache/datafusion/issues/15182 On Mon, Mar 10, 2025 at 8:52 AM Andrew Lamb <al...@influxdata.com> wrote: > I have incorporated feedback left on github. The current draft is below. I > plan to submit this on Wednesday so please add any suggestions via email / > github or on the google doc > > Andrew > > ## Description: > > The mission of Apache DataFusion is the creation and maintenance of > software > > related to an extensible query engine > > ## Project Status: > > Current project status: New + Ongoing (high activity) > > Issues for the board: None > > ## Membership Data: > > Apache DataFusion was founded 2024-04-16 (10 months ago) > > There are currently 43 committers and 15 PMC members in this project. > > The Committer-to-PMC ratio is roughly 3:1. > > Community changes, past quarter: > > - Jonah Gao was added to the PMC on 2024-12-16 > > - Piotr Findeisen was added as committer on 2024-12-03 > > - Ruiqiu Cao was added as committer on 2024-12-10 > > - Yongting You was added as committer on 2025-01-18 > > Note that almost all communication for DataFusion and its subprojects > happens on github and so our dev mailing list traffic is fairly light. > > ## Project Activity: > > ### Overall > > DataFusion is participating in Google Summer of Code with a number of ideas > for projects with mentors already selected[1][2][3]. Additionally, some > ideas on how to make DataFusion an ideal selection for university database > projects such as the CMU database classes have been put forward. > > [1]: https://github.com/apache/datafusion/issues/14577 > > [2]: > > https://summerofcode.withgoogle.com/programs/2025/organizations/apache-datafusion > > [3]: > > https://datafusion.apache.org/contributor-guide/gsoc_application_guidelines.html > > > ### DataFusion core > > https://github.com/apache/datafusion > > - 46.0.0 was released on 2025-03-07. > > - 45.0.0 was released on 2025-02-07. > > - 44.0.0 was released on 2024-12-31. > > Releases continue monthly and the project has been very active with many > commits a day. It seems more new projects have been using DataFusion for > query processing, which brings more contributors but also means we are > spending more time fielding questions and figuring out how many more > features to accept. > > Bruce Ritchie recently authored a [blog] about some of the features and the > outlook for the next 6 months. A relevant quote: > > > In the core DataFusion repo alone we reviewed and accepted almost 1600 > PRs from 206 different committers, created over 1100 issues and closed 751 > of them 🚀. > > We have been focusing more recently on pre-release testing and making it > easier for downstream consumers to use DataFusion, which is still a > challenge given how fast the project is moving. > > [blog]: https://datafusion.apache.org/blog/2025/02/20/datafusion-45.0.0/ > > ### Sub project: DataFusion Python > > https://github.com/apache/datafusion-python > > - PYTHON-45.2.0 was released on 2025-02-23. > > - PYTHON-44.0.0 was released on 2025-02-07. > > - PYTHON-43.1.0 was released on 2024-12-12. > > We have been working on making it easier to interoperate with other > systems, including support for FFI TableProvider ([#12920]) and new user > documentation on FFI [#1031] > > [#12920]: https://github.com/apache/datafusion/pull/12920 > > [#1031]: https://github.com/apache/datafusion-python/pull/1031 > > ### Sub project: DataFusion Comet > > https://github.com/apache/datafusion-comet > > - COMET-0.6.0 was released on 2025-02-17. > > - COMET-0.5.0 was released on 2025-01-17. > > You can read about the recent happenings in Comet in the [0.6.0 blog] > > > [0.6.0 blog]: > https://datafusion.apache.org/blog/2025/02/17/datafusion-comet-0.6.0/ > > > ### Sub project: DataFusion Ballista > > https://github.com/apache/datafusion-ballista > > - BALLISTA-44.0.0 was released on 2025-03-05. > > There has been some renewed interest in this project as the foundation for > distributed query engines, and we made a new release recently. > > ### (New!) Sub project: DataFusion Ray > > https://github.com/apache/datafusion-ray > > This is a new project aims to make it easier to run DataFusion in a > distributed environment using the https://www.ray.io/ compute engine > > Contributors are working hard at the moment to get DataFusionRay 0.1.0 out! > Hopefully we can do that before the announcement and then there should be > plenty to add. > > ### Sub project: sqlparser-rs > > https://github.com/apache/datafusion-sqlparser-rs > > We have made two releases since sqlparser became part of DataFusion. > > - SQLPARSER-0.55.0 was released on 2025-03-05. > > - SQLPARSER-0.54.0 was released on 2025-01-23. > > - SQLPARSER-0.53.0 was released on 2024-12-18. > > Ifeanyi Ubah (iffyio) is doing a great job reviewing PRs to keep the code > consistent and flowing. > > ## Community Health: > > While we as always struggle with code review capacity, we have many > > active committers, and the community in general helps each other out with > > reviews. We continue to actively grow our committer and PMC ranks. > > We had several in person meetups in Chicago, Boston, and Amsterdam, and are > working on organizing one in London in April 2025[1]. > > [1]: https://github.com/apache/datafusion/discussions/14647 > > > > On Sat, Mar 1, 2025 at 7:06 AM Andrew Lamb <al...@influxdata.com> wrote: > > > Hello Fearless DataFusion(iers)! > > > > We have an ASF board report due in 2 weeks and I have started a draft. > > > > Please feel free to post comments to the doc[1] or the ticket[2] or this > > thread and I will incorporate them > > > > [1]: > > > https://docs.google.com/document/d/11b2GEmPh5gblWWegeZi3G38e97vRqHSRElkLTwZHrjY > > [2]: https://github.com/apache/datafusion/issues/13713 > > > > Current draft is below. > > > > > > ``` > > ## Description: > > The mission of Apache DataFusion is the creation and maintenance of > > software > > related to an extensible query engine > > > > ## Project Status: > > Current project status: New + Ongoing (high activity) > > Issues for the board: None > > > > ## Membership Data: > > Apache DataFusion was founded 2024-04-16 (10 months ago) > > There are currently 43 committers and 15 PMC members in this project. > > The Committer-to-PMC ratio is roughly 3:1. > > > > Community changes, past quarter: > > - Jonah Gao was added to the PMC on 2024-12-16 > > - Piotr Findeisen was added as committer on 2024-12-03 > > - Ruiqiu Cao was added as committer on 2024-12-10 > > - Yongting You was added as committer on 2025-01-18 > > > > Note that almost all communication for DataFusion and its subprojects > > happens on github and so our dev mailing list traffic is fairly light. > > > > ## Project Activity: > > > > ### Overall > > > > ### DataFusion core > > > > 45.0.0 was released on 2025-02-07. > > 44.0.0 was released on 2024-12-31. > > > > https://github.com/apache/datafusion > > > > Releases continue monthly and the project has been very active with many > > commits a day. It seems many new projects have been using DataFusion for > > query processing which brings more contributors but also means we are > > spending more time fielding questions and figuring out how many more > > features to accept. > > > > Bruce Ritchie recently authored a [blog] about some of the features and > > the outlook for the next 6 months. > > > > We have been focusing more recently on pre-release testing and making it > > easier for downstream consumers to use DataFusion, which is still a > > challenge given how fast the project is moving. > > > > [blog]: https://datafusion.apache.org/blog/2025/02/20/datafusion-45.0.0/ > > > > ### Sub project: DataFusion Python > > > > https://github.com/apache/datafusion-python > > > > PYTHON-45.2.0 was released on 2025-02-23. > > PYTHON-44.0.0 was released on 2025-02-07. > > PYTHON-43.1.0 was released on 2024-12-12. > > > > > > ### Sub project: DataFusion Comet > > > > https://github.com/apache/datafusion-comet > > > > > > COMET-0.6.0 was released on 2025-02-17. > > COMET-0.5.0 was released on 2025-01-17. > > > > > > ### Sub project: DataFusion Ballista > > > > https://github.com/apache/datafusion-ballista > > > > ### (New!) Sub project: DataFusion Ray > > > > https://github.com/apache/datafusion-ray > > > > This is a new project aims to make it easier to run DataFusion in a > > distributed environment using the https://www.ray.io/ compute engine > > > > ### Sub project: Sqlparser > > > > We have made two releases since sqlparser became part of DataFusion. > > > > - SQLPARSER-0.54.0 was released on 2025-01-23. > > - SQLPARSER-0.53.0 was released on 2024-12-18. > > > > Ifeanyi / iffyio is doing a great job reviewing PRs to keep the code > > consistent and flowing. > > > > ## Community Health: > > > > While we as always struggle to get enough code review capacity, we have > > many > > active committers, and the community in general helps each other out with > > reviews. We continue to actively grow our committer and PMC ranks. > > > > We had several in person meetups in Chicago, Boston, and Amsterdam, > though > > we don’t have any more > > > > ``` > > >