Thank you to everyone who contributed to the board report. Please find the report I submitted to the board below.
Andrew ## Description: The mission of Apache DataFusion is the creation and maintenance of software related to an extensible query engine ## Project Status: Current project status: New + Ongoing (high activity) Issues for the board: None ## Membership Data: Apache DataFusion was founded 2024-04-16 (2 years ago) There are currently 58 committers and 22 PMC members in this project. The Committer-to-PMC ratio is roughly 8:3. Community changes, past quarter: - No new PMC members. Last addition was Adrian Garcia Badaracco on 2026-02-01. - Bhargava Vadlamani was added as committer on 2026-04-28 - Kumar Ujjawal was added as committer on 2026-04-28 ## Project Activity: Note that almost all communication for DataFusion and its subprojects happens on github and so our dev mailing list traffic is fairly light. ### DataFusion core https://github.com/apache/datafusion 54.0.0 was released on 2026-06-09. 53.1.0 was released on 2026-04-16. 53.0.0 was released on 2026-03-23. 52.5.0 was released on 2026-04-11. 52.4.0 was released on 2026-03-22. 52.3.0 was released on 2026-03-12. Our releases now consist of contributions from over 120 distinct contributors (was 100), and we average around [9.2 commits per day] to the main repo (up from [7.8 commits per day]) [9.2 commits per day]: git rev-list --count apache/main --since='2026-03-10 00:00:00' --until='2026-06-08 23:59:59' [7.8 commits per day]: git rev-list --count apache/main --since='2026-02-09 00:00:00' --until='2026-03-09 23:59:59' The community continues to write blogs highlighting our work, see https://datafusion.apache.org/blog/ We continue to hold small scale in person meetups in various locations, which have been successful in bringing together contributors. We had events in Portland, Seattle, NYC, and Stockholm, and are trying to hold more in Asia, such as in China. See a list here: https://datafusion.apache.org/user-guide/concepts-readings-events.html#community-events The overall number of PRs in need of review has been growing, likely due to increasing use of AI coding tools and the overall growth of the community. As the project matures, time is extending between major releases, likely due to increased testing and attention to quality. ### Sub project: DataFusion Python https://github.com/apache/datafusion-python DATAFUSION-PYTHON-53.0.0 was released on 2026-04-12. DATAFUSION-PYTHON-52.3.0 was released on 2026-03-16. In version 53.0.0 we introduced new AI workflows into the project. The primary outcome of this is to provide a method to ensure we have consistent coverage between the exposed datafusion-python APIs and the upstream functions in the core repository. This workflow exposed 55 function gaps between the two repositories that were then corrected. Additionally the datafusion-python project went through a massive overhaul in the documentation of the API surface area to include usage docstrings directly aimed at improving the ability for LLM agents to write effective datafusion-python code. We have additionally released an agent skill that improves the ability of LLMs to write idiomatic datafusion-python code. This has been tested against the TPC-H queries where agents can now faithfully reproduce queries to pass these tests using only the text description of the query. Since the release of 53.0.0 we have added two new LLM skills to complement the above work. First we added a skill that ensures all of the newly exposed functions are “pythonic” in nature rather than just exposing the Rust interface directly. Second we have a skill that verifies that the user facing skill to write idiomatic code is kept up to date with the API surface area of the project. We have published a blog based on the experience of writing these agent skills. You can read it here: https://datafusion.apache.org/blog/2026/05/28/writing-agent-skills/ ### New sub project: DataFusion Java We have added Java Bindings as a subproject. You can read about it here: https://datafusion.apache.org/blog/output/2026/05/26/datafusion-java-0.1.0/ ### Sub project: DataFusion Comet COMET-0.16.0 was released on 2026-01-29. https://github.com/apache/datafusion-comet You can read about the recent happenings in Comet in the blogs: https://datafusion.apache.org/blog/2026/05/07/datafusion-comet-0.16.0 ### Sub project: DataFusion Ballista https://github.com/apache/datafusion-ballista BALLISTA-53.0.0 was released on 2026-05-24 BALLISTA-52.0.0 was released on 2026-03-07. BALLISTA-51.0.0 was released on 2026-01-19. The community has published new post outlining changes to ballista in last 12 months https://datafusion.apache.org/blog/output/2026/05/24/datafusion-ballista-53.0.0/ There has been an increase of number contributions to Ballista, and PR reviews, which is very positive. Efforts were focused on improving observability of running jobs and usability. With hope to improve ballista robustness and performance for SF1000+ workloads. I hope this trend of increased contributions is going to persist in the future. ### Sub project: sqlparser-rs SQLPARSER-0.62.0 was released on 2026-05-27. https://github.com/apache/datafusion-sqlparser-rs Ifeanyi Ubah (iffyio) continues to review most PRs in this repo. ## Community Health: While we as always struggle with code review capacity, we have many active committers, and the community in general helps each other out with reviews. We continue to actively grow our committer and PMC ranks. We continue to merge multiple PRs a day from multiple committers and have contributions from a wide variety of individuals with a wide variety of employers, organizations, and backgrounds. On Mon, Jun 8, 2026 at 7:31 AM Andrew Lamb <[email protected]> wrote: > It is that time again -- the ASF board report is due June 10 (in 2 days). > Sorry for the short notice. > > As is our custom, I try and gather input from the community on any issues > they think we should raise to the board, and I will submit the report. > > If anyone has content they would like to add to the report, please respond > to this email, add it as a comment on the ticket, or a suggestion directly > into the document. > > Tracking ticket: https://github.com/apache/datafusion/issues/20874 > > Google Doc: > > https://docs.google.com/document/d/152NPdyW7hExjzdYI4bhVsWHrV1AYAa1mmZq_90bgwSs > > Thanks, > Andrew > > > > > --------- > > 2026-06-10 DataFusion ASF Board Report > https://github.com/apache/datafusion/issues/20874 > > DataFusion PMC Chair Note: Please add any relevant comments / content to > this document. I (Andrew Lamb) will submit to the ASF board on June 10, > 2026 (about one week prior to the scheduled board meeting). > > The format of this report and the metrics are from > https://reporter.apache.org/wizard/?datafusion > > The rationale and process for this report: > https://www.apache.org/foundation/board/reporting > Past examples: 2026-03-11 DataFusion ASF Board Report > > ## Description: > The mission of Apache DataFusion is the creation and maintenance of > software > related to an extensible query engine > > ## Project Status: > Current project status: New + Ongoing (high activity) > Issues for the board: None > > ## Membership Data: > > Apache DataFusion was founded 2024-04-16 (2 years ago) > There are currently 58 committers and 22 PMC members in this project. > The Committer-to-PMC ratio is roughly 8:3. > > Community changes, past quarter: > - No new PMC members. Last addition was Adrian Garcia Badaracco on > 2026-02-01. > - Bhargava Vadlamani was added as committer on 2026-04-28 > - Kumar Ujjawal was added as committer on 2026-04-28 > > > ## Project Activity: > Note that almost all communication for DataFusion and its subprojects > happens > on github and so our dev mailing list traffic is fairly light. > > ### DataFusion core > https://github.com/apache/datafusion > 53.1.0 was released on 2026-04-16. > 53.0.0 was released on 2026-03-23. > 52.5.0 was released on 2026-04-11. > 52.4.0 was released on 2026-03-22. > 52.3.0 was released on 2026-03-12. > > > Our releases now consist of contributions from over 120 distinct > contributors (was 100), and we average around [9.2 commits per day] to the > main repo (up from [7.8 commits per day]) > > [9.2 commits per day]: git rev-list --count apache/main --since='2026-03-10 > 00:00:00' --until='2026-06-08 23:59:59' > [7.8 commits per day]: git rev-list --count apache/main --since='2026-02-09 > 00:00:00' --until='2026-03-09 23:59:59' > > The community continues to write blogs highlighting our work > https://datafusion.apache.org/blog/2026/02/02/datafusion_case > https://datafusion.apache.org/blog/2026/01/30/datafusion-comet-0.13.0 > https://datafusion.apache.org/blog/2026/01/12/datafusion-52.0.0 > https://datafusion.apache.org/blog/2026/01/12/extending-sql > > https://datafusion.apache.org/blog/2025/12/15/avoid-consecutive-repartitions > > We continue to hold small scale in person meetups in various locations, > which have been successful in bringing together contributors. We had events > in Portland, Seattle, NYC, and Stockholm, and are trying to hold more in > Asia, such as in China. See a list here: > > https://datafusion.apache.org/user-guide/concepts-readings-events.html#community-events > > The overall number of PRs in need of review has been growing, likely due to > increasing use of AI coding tools and the overall growth of the community. > > As the project matures, time is extending between major releases, likely > due to increased testing and attention to quality. > > > ### Sub project: DataFusion Python > > https://github.com/apache/datafusion-python > DATAFUSION-PYTHON-53.0.0 was released on 2026-04-12. > DATAFUSION-PYTHON-52.3.0 was released on 2026-03-16. > > ### New sub project: DataFusion Java > > We have added Java Bindings as a subproject. You can read about it here: > > https://datafusion.apache.org/blog/output/2026/05/26/datafusion-java-0.1.0/ > > > > ### Sub project: DataFusion Comet > > COMET-0.16.0 was released on 2026-01-29. > > https://github.com/apache/datafusion-comet > > > You can read about the recent happenings in Comet in the blogs: > https://datafusion.apache.org/blog/2026/05/07/datafusion-comet-0.16.0 > > > > ### Sub project: DataFusion Ballista > > https://github.com/apache/datafusion-ballista > BALLISTA-52.0.0 was released on 2026-03-07. > BALLISTA-51.0.0 was released on 2026-01-19. > > There has been an increase of number contributions to ballista, and PR > reviews, which is very positive. I hope this trend is going to persist in > the future. > > > ### Sub project: sqlparser-rs > > SQLPARSER-0.62.0 was released on 2026-05-27. > > https://github.com/apache/datafusion-sqlparser-rs > > Ifeanyi Ubah (iffyio) continues to review most PRs in this repo. > > > ## Community Health: > > While we as always struggle with code review capacity, > we have many active committers, and the community in general helps each > other out with reviews. We continue to actively grow our committer > and PMC ranks. > > We continue to merge multiple PRs a day from multiple committers and > have contributions from a wide variety of individuals with a wide > variety of employers, organizations, and backgrounds. >
