Thank you to everyone who contributed to this quarters report. I have
submitted to the ASF board and filed a ticket for the next report in June
[1]

[1]: https://github.com/apache/datafusion/issues/15182

On Mon, Mar 10, 2025 at 8:52 AM Andrew Lamb <al...@influxdata.com> wrote:

> I have incorporated feedback left on github. The current draft is below. I
> plan to submit this on Wednesday so please add any suggestions via email /
> github  or on the google doc
>
> Andrew
>
> ## Description:
>
> The mission of Apache DataFusion is the creation and maintenance of
> software
>
> related to an extensible query engine
>
> ## Project Status:
>
> Current project status: New + Ongoing (high activity)
>
> Issues for the board: None
>
> ## Membership Data:
>
> Apache DataFusion was founded 2024-04-16 (10 months ago)
>
> There are currently 43 committers and 15 PMC members in this project.
>
> The Committer-to-PMC ratio is roughly 3:1.
>
> Community changes, past quarter:
>
> - Jonah Gao was added to the PMC on 2024-12-16
>
> - Piotr Findeisen was added as committer on 2024-12-03
>
> - Ruiqiu Cao was added as committer on 2024-12-10
>
> - Yongting You was added as committer on 2025-01-18
>
> Note that almost all communication for DataFusion and its subprojects
> happens on github and so our dev mailing list traffic is fairly light.
>
> ## Project Activity:
>
> ### Overall
>
> DataFusion is participating in Google Summer of Code with a number of ideas
> for projects with mentors already selected[1][2][3]. Additionally, some
> ideas on how to make DataFusion an ideal selection for university database
> projects such as the CMU database classes have been put forward.
>
> [1]: https://github.com/apache/datafusion/issues/14577
>
> [2]:
>
> https://summerofcode.withgoogle.com/programs/2025/organizations/apache-datafusion
>
> [3]:
>
> https://datafusion.apache.org/contributor-guide/gsoc_application_guidelines.html
>
>
> ### DataFusion core
>
> https://github.com/apache/datafusion
>
> - 46.0.0 was released on 2025-03-07.
>
> - 45.0.0 was released on 2025-02-07.
>
> - 44.0.0 was released on 2024-12-31.
>
> Releases continue monthly and the project has been very active with many
> commits a day. It seems more new projects have been using DataFusion for
> query processing, which brings more contributors but also means we are
> spending more time fielding questions and figuring out how many more
> features to accept.
>
> Bruce Ritchie recently authored a [blog] about some of the features and the
> outlook for the next 6 months. A relevant quote:
>
> > In the core DataFusion repo alone we reviewed and accepted almost 1600
> PRs from 206 different committers, created over 1100 issues and closed 751
> of them 🚀.
>
> We have been focusing more recently on pre-release testing and making it
> easier for downstream consumers to use DataFusion, which is still a
> challenge given how fast the project is moving.
>
> [blog]: https://datafusion.apache.org/blog/2025/02/20/datafusion-45.0.0/
>
> ### Sub project: DataFusion Python
>
> https://github.com/apache/datafusion-python
>
> - PYTHON-45.2.0 was released on 2025-02-23.
>
> - PYTHON-44.0.0 was released on 2025-02-07.
>
> - PYTHON-43.1.0 was released on 2024-12-12.
>
> We have been working on making it easier to interoperate with other
> systems, including support for FFI TableProvider ([#12920]) and new user
> documentation on FFI [#1031]
>
> [#12920]: https://github.com/apache/datafusion/pull/12920
>
> [#1031]: https://github.com/apache/datafusion-python/pull/1031
>
> ### Sub project: DataFusion Comet
>
> https://github.com/apache/datafusion-comet
>
> - COMET-0.6.0 was released on 2025-02-17.
>
> - COMET-0.5.0 was released on 2025-01-17.
>
> You can read about the recent happenings in Comet in the [0.6.0 blog]
>
>
> [0.6.0 blog]:
> https://datafusion.apache.org/blog/2025/02/17/datafusion-comet-0.6.0/
>
>
> ### Sub project: DataFusion Ballista
>
> https://github.com/apache/datafusion-ballista
>
> - BALLISTA-44.0.0 was released on 2025-03-05.
>
> There has been some renewed interest in this project as the foundation for
> distributed query engines, and we made a new release recently.
>
> ### (New!) Sub project: DataFusion Ray
>
> https://github.com/apache/datafusion-ray
>
> This is a new project aims to make it easier to run DataFusion in a
> distributed environment using the https://www.ray.io/ compute engine
>
> Contributors are working hard at the moment to get DataFusionRay 0.1.0 out!
> Hopefully we can do that before the announcement and then there should be
> plenty to add.
>
> ### Sub project: sqlparser-rs
>
> https://github.com/apache/datafusion-sqlparser-rs
>
> We have made two releases since sqlparser became part of DataFusion.
>
> - SQLPARSER-0.55.0 was released on 2025-03-05.
>
> - SQLPARSER-0.54.0 was released on 2025-01-23.
>
> - SQLPARSER-0.53.0 was released on 2024-12-18.
>
> Ifeanyi Ubah (iffyio) is doing a great job reviewing PRs to keep the code
> consistent and flowing.
>
> ## Community Health:
>
> While we as always struggle with code review capacity, we have many
>
> active committers, and the community in general helps each other out with
>
> reviews. We continue to actively grow our committer and PMC ranks.
>
> We had several in person meetups in Chicago, Boston, and Amsterdam, and are
> working on organizing one in London in April 2025[1].
>
> [1]: https://github.com/apache/datafusion/discussions/14647
>
>
>
> On Sat, Mar 1, 2025 at 7:06 AM Andrew Lamb <al...@influxdata.com> wrote:
>
> > Hello Fearless DataFusion(iers)!
> >
> > We have an ASF board report due in 2 weeks and I have started a draft.
> >
> > Please feel free to post comments to the doc[1] or the ticket[2] or this
> > thread and I will incorporate them
> >
> > [1]:
> >
> https://docs.google.com/document/d/11b2GEmPh5gblWWegeZi3G38e97vRqHSRElkLTwZHrjY
> > [2]: https://github.com/apache/datafusion/issues/13713
> >
> > Current draft is below.
> >
> >
> > ```
> > ## Description:
> > The mission of Apache DataFusion is the creation and maintenance of
> > software
> > related to an extensible query engine
> >
> > ## Project Status:
> > Current project status: New + Ongoing (high activity)
> > Issues for the board: None
> >
> > ## Membership Data:
> > Apache DataFusion was founded 2024-04-16 (10 months ago)
> > There are currently 43 committers and 15 PMC members in this project.
> > The Committer-to-PMC ratio is roughly 3:1.
> >
> > Community changes, past quarter:
> > - Jonah Gao was added to the PMC on 2024-12-16
> > - Piotr Findeisen was added as committer on 2024-12-03
> > - Ruiqiu Cao was added as committer on 2024-12-10
> > - Yongting You was added as committer on 2025-01-18
> >
> > Note that almost all communication for DataFusion and its subprojects
> > happens on github and so our dev mailing list traffic is fairly light.
> >
> > ## Project Activity:
> >
> > ### Overall
> >
> > ### DataFusion core
> >
> > 45.0.0 was released on 2025-02-07.
> > 44.0.0 was released on 2024-12-31.
> >
> > https://github.com/apache/datafusion
> >
> > Releases continue monthly and the project has been very active with many
> > commits a day. It seems many new projects have been using DataFusion for
> > query processing which brings more contributors but also means we are
> > spending more time fielding questions and figuring out how many more
> > features to accept.
> >
> > Bruce Ritchie recently authored a [blog] about some of the features and
> > the outlook for the next 6 months.
> >
> > We have been focusing more recently on pre-release testing and making it
> > easier for downstream consumers to use DataFusion, which is still a
> > challenge given how fast the project is moving.
> >
> > [blog]: https://datafusion.apache.org/blog/2025/02/20/datafusion-45.0.0/
> >
> > ### Sub project: DataFusion Python
> >
> > https://github.com/apache/datafusion-python
> >
> > PYTHON-45.2.0 was released on 2025-02-23.
> > PYTHON-44.0.0 was released on 2025-02-07.
> > PYTHON-43.1.0 was released on 2024-12-12.
> >
> >
> > ### Sub project: DataFusion Comet
> >
> > https://github.com/apache/datafusion-comet
> >
> >
> > COMET-0.6.0 was released on 2025-02-17.
> > COMET-0.5.0 was released on 2025-01-17.
> >
> >
> > ### Sub project: DataFusion Ballista
> >
> > https://github.com/apache/datafusion-ballista
> >
> > ### (New!) Sub project: DataFusion Ray
> >
> > https://github.com/apache/datafusion-ray
> >
> > This is a new project aims to make it easier to run DataFusion in a
> > distributed environment using the https://www.ray.io/ compute engine
> >
> > ### Sub project: Sqlparser
> >
> > We have made two releases since sqlparser became part of DataFusion.
> >
> > - SQLPARSER-0.54.0 was released on 2025-01-23.
> > - SQLPARSER-0.53.0 was released on 2024-12-18.
> >
> > Ifeanyi / iffyio is doing a great job reviewing PRs to keep the code
> > consistent and flowing.
> >
> > ## Community Health:
> >
> > While we as always struggle to get enough code review capacity, we have
> > many
> > active committers, and the community in general helps each other out with
> > reviews. We continue to actively grow our committer and PMC ranks.
> >
> > We had several in person meetups in Chicago, Boston, and Amsterdam,
> though
> > we don’t have any more
> >
> > ```
> >
>

Reply via email to