I have incorporated feedback left on github. The current draft is below. I
plan to submit this on Wednesday so please add any suggestions via email /
github  or on the google doc

Andrew

## Description:

The mission of Apache DataFusion is the creation and maintenance of
software

related to an extensible query engine

## Project Status:

Current project status: New + Ongoing (high activity)

Issues for the board: None

## Membership Data:

Apache DataFusion was founded 2024-04-16 (10 months ago)

There are currently 43 committers and 15 PMC members in this project.

The Committer-to-PMC ratio is roughly 3:1.

Community changes, past quarter:

- Jonah Gao was added to the PMC on 2024-12-16

- Piotr Findeisen was added as committer on 2024-12-03

- Ruiqiu Cao was added as committer on 2024-12-10

- Yongting You was added as committer on 2025-01-18

Note that almost all communication for DataFusion and its subprojects
happens on github and so our dev mailing list traffic is fairly light.

## Project Activity:

### Overall

DataFusion is participating in Google Summer of Code with a number of ideas
for projects with mentors already selected[1][2][3]. Additionally, some
ideas on how to make DataFusion an ideal selection for university database
projects such as the CMU database classes have been put forward.

[1]: https://github.com/apache/datafusion/issues/14577

[2]:
https://summerofcode.withgoogle.com/programs/2025/organizations/apache-datafusion

[3]:
https://datafusion.apache.org/contributor-guide/gsoc_application_guidelines.html


### DataFusion core

https://github.com/apache/datafusion

- 46.0.0 was released on 2025-03-07.

- 45.0.0 was released on 2025-02-07.

- 44.0.0 was released on 2024-12-31.

Releases continue monthly and the project has been very active with many
commits a day. It seems more new projects have been using DataFusion for
query processing, which brings more contributors but also means we are
spending more time fielding questions and figuring out how many more
features to accept.

Bruce Ritchie recently authored a [blog] about some of the features and the
outlook for the next 6 months. A relevant quote:

> In the core DataFusion repo alone we reviewed and accepted almost 1600
PRs from 206 different committers, created over 1100 issues and closed 751
of them 🚀.

We have been focusing more recently on pre-release testing and making it
easier for downstream consumers to use DataFusion, which is still a
challenge given how fast the project is moving.

[blog]: https://datafusion.apache.org/blog/2025/02/20/datafusion-45.0.0/

### Sub project: DataFusion Python

https://github.com/apache/datafusion-python

- PYTHON-45.2.0 was released on 2025-02-23.

- PYTHON-44.0.0 was released on 2025-02-07.

- PYTHON-43.1.0 was released on 2024-12-12.

We have been working on making it easier to interoperate with other
systems, including support for FFI TableProvider ([#12920]) and new user
documentation on FFI [#1031]

[#12920]: https://github.com/apache/datafusion/pull/12920

[#1031]: https://github.com/apache/datafusion-python/pull/1031

### Sub project: DataFusion Comet

https://github.com/apache/datafusion-comet

- COMET-0.6.0 was released on 2025-02-17.

- COMET-0.5.0 was released on 2025-01-17.

You can read about the recent happenings in Comet in the [0.6.0 blog]


[0.6.0 blog]:
https://datafusion.apache.org/blog/2025/02/17/datafusion-comet-0.6.0/


### Sub project: DataFusion Ballista

https://github.com/apache/datafusion-ballista

- BALLISTA-44.0.0 was released on 2025-03-05.

There has been some renewed interest in this project as the foundation for
distributed query engines, and we made a new release recently.

### (New!) Sub project: DataFusion Ray

https://github.com/apache/datafusion-ray

This is a new project aims to make it easier to run DataFusion in a
distributed environment using the https://www.ray.io/ compute engine

Contributors are working hard at the moment to get DataFusionRay 0.1.0 out!
Hopefully we can do that before the announcement and then there should be
plenty to add.

### Sub project: sqlparser-rs

https://github.com/apache/datafusion-sqlparser-rs

We have made two releases since sqlparser became part of DataFusion.

- SQLPARSER-0.55.0 was released on 2025-03-05.

- SQLPARSER-0.54.0 was released on 2025-01-23.

- SQLPARSER-0.53.0 was released on 2024-12-18.

Ifeanyi Ubah (iffyio) is doing a great job reviewing PRs to keep the code
consistent and flowing.

## Community Health:

While we as always struggle with code review capacity, we have many

active committers, and the community in general helps each other out with

reviews. We continue to actively grow our committer and PMC ranks.

We had several in person meetups in Chicago, Boston, and Amsterdam, and are
working on organizing one in London in April 2025[1].

[1]: https://github.com/apache/datafusion/discussions/14647



On Sat, Mar 1, 2025 at 7:06 AM Andrew Lamb <al...@influxdata.com> wrote:

> Hello Fearless DataFusion(iers)!
>
> We have an ASF board report due in 2 weeks and I have started a draft.
>
> Please feel free to post comments to the doc[1] or the ticket[2] or this
> thread and I will incorporate them
>
> [1]:
> https://docs.google.com/document/d/11b2GEmPh5gblWWegeZi3G38e97vRqHSRElkLTwZHrjY
> [2]: https://github.com/apache/datafusion/issues/13713
>
> Current draft is below.
>
>
> ```
> ## Description:
> The mission of Apache DataFusion is the creation and maintenance of
> software
> related to an extensible query engine
>
> ## Project Status:
> Current project status: New + Ongoing (high activity)
> Issues for the board: None
>
> ## Membership Data:
> Apache DataFusion was founded 2024-04-16 (10 months ago)
> There are currently 43 committers and 15 PMC members in this project.
> The Committer-to-PMC ratio is roughly 3:1.
>
> Community changes, past quarter:
> - Jonah Gao was added to the PMC on 2024-12-16
> - Piotr Findeisen was added as committer on 2024-12-03
> - Ruiqiu Cao was added as committer on 2024-12-10
> - Yongting You was added as committer on 2025-01-18
>
> Note that almost all communication for DataFusion and its subprojects
> happens on github and so our dev mailing list traffic is fairly light.
>
> ## Project Activity:
>
> ### Overall
>
> ### DataFusion core
>
> 45.0.0 was released on 2025-02-07.
> 44.0.0 was released on 2024-12-31.
>
> https://github.com/apache/datafusion
>
> Releases continue monthly and the project has been very active with many
> commits a day. It seems many new projects have been using DataFusion for
> query processing which brings more contributors but also means we are
> spending more time fielding questions and figuring out how many more
> features to accept.
>
> Bruce Ritchie recently authored a [blog] about some of the features and
> the outlook for the next 6 months.
>
> We have been focusing more recently on pre-release testing and making it
> easier for downstream consumers to use DataFusion, which is still a
> challenge given how fast the project is moving.
>
> [blog]: https://datafusion.apache.org/blog/2025/02/20/datafusion-45.0.0/
>
> ### Sub project: DataFusion Python
>
> https://github.com/apache/datafusion-python
>
> PYTHON-45.2.0 was released on 2025-02-23.
> PYTHON-44.0.0 was released on 2025-02-07.
> PYTHON-43.1.0 was released on 2024-12-12.
>
>
> ### Sub project: DataFusion Comet
>
> https://github.com/apache/datafusion-comet
>
>
> COMET-0.6.0 was released on 2025-02-17.
> COMET-0.5.0 was released on 2025-01-17.
>
>
> ### Sub project: DataFusion Ballista
>
> https://github.com/apache/datafusion-ballista
>
> ### (New!) Sub project: DataFusion Ray
>
> https://github.com/apache/datafusion-ray
>
> This is a new project aims to make it easier to run DataFusion in a
> distributed environment using the https://www.ray.io/ compute engine
>
> ### Sub project: Sqlparser
>
> We have made two releases since sqlparser became part of DataFusion.
>
> - SQLPARSER-0.54.0 was released on 2025-01-23.
> - SQLPARSER-0.53.0 was released on 2024-12-18.
>
> Ifeanyi / iffyio is doing a great job reviewing PRs to keep the code
> consistent and flowing.
>
> ## Community Health:
>
> While we as always struggle to get enough code review capacity, we have
> many
> active committers, and the community in general helps each other out with
> reviews. We continue to actively grow our committer and PMC ranks.
>
> We had several in person meetups in Chicago, Boston, and Amsterdam, though
> we don’t have any more
>
> ```
>

Reply via email to