Hello Arrow Community, TLDR: Please add any comments or board content directly to [2] or reply to this email and I will incorporate your comments. You can see what we currently have at the end of this email.
In an epic scheduling fail, I forgot to organize this report a few weeks ago, so now the deadline is tight. One of the responsibilities of being part of the Apache Software Foundation (ASF) is to regularly summarize the state of the project in a quarterly update to the ASF board. I plan to submit the next report on July 14, 2023 (in 2 days time -- I am sorry for the late notice) Historically[1], Arrow has crowd sourced the content which has worked well. While this is partly an administrative reporting exercise, I think it is also valuable to reflect on the past and think about goals for the future. It would be especially interesting if anyone from the various language implementation communities could provide an update of a sentence or two. Andrew [1]: https://lists.apache.org/thread/xg7pgj4stt4l2sblyt81y9s6h0cl8hw5 [2]: https://docs.google.com/document/d/13FSDydEVXT2UUFdy4XKjVKNJW-WR8ylvG3aI6lD-dNI/edit# ## Description: The mission of Apache Arrow is the creation and maintenance of software related to columnar in-memory processing and data interchange. More information can be found at https://arrow.apache.org/overview/ ## Issues: ## Membership Data: Apache Arrow was founded 2016-01-19 (7 years ago) There are currently 97 committers and 50 PMC members in this project. The Committer-to-PMC ratio is roughly 7:4. Community changes, past quarter: - Ben Baumgold was added to the PMC on 2023-06-19 - Jie Wen was added to the PMC on 2023-06-10 - Dewey Dunnington was added to the PMC on 2023-06-22 - Matthew Topol was added to the PMC on 2023-05-02 - Gang Wu was added as committer on 2023-05-15 - Kevin Gurney was added as committer on 2023-07-04 - Marco Neumann was added as committer on 2023-05-11 - Mehmet Ozan Kabak was added as committer on 2023-06-10 - Ruihang Xia was added as committer on 2023-04-15 ## Project Activity: There has been healthy debate about adding new formats, [StringArray] and [ListView], focused on increasing Arrow’s appeal in high performance computation engines. We have completed the transition from JIRA to using Github issues for the mono repo and that appears to be going well. The DataFusion subproject is considering applying to become its own top level Apache project (see DataFusion update below) [StringArray]: https://lists.apache.org/thread/c6frlr9gcxy8qdhbmv8cn3rdjbrqxb1v [ListView]: https://lists.apache.org/thread/r28rw5n39jwtvn08oljl09d4q2c1ysvb ## Community Health: There have been 9 blog posts published to https://arrow.apache.org/blog/ in the last 3 months, including two from community members on their use of Arrow ## Sub Project Updates Arrow has several subprojects, as listed on https://arrow.apache.org/ ### ADBC ### Arrow Flight ### Arrow Flight SQL ### DataFusion DataFusion continues to grow and mature. The community added many new features as described in the latest [blog] post, and discussed and came to consensus on the [goals] of the project and is discussing a [move to its own top level Apache project]. Current development focus is on performance and adding better support for structured types such as LIsts and Structs. We expect more work on improving documentation and communicating externally over the next quarter. [blog]: https://arrow.apache.org/blog/2023/06/24/datafusion-25.0.0/ [goals]: https://github.com/apache/arrow-datafusion/discussions/6441 [move to its own top level Apache project]: https://github.com/apache/arrow-datafusion/discussions/6475 ## Language Area Updates Arrow has at least 12 different language implementations, as explained in https://arrow.apache.org/overview/ Arrow 12.0.0 was released from the monorepo: https://arrow.apache.org/blog/2023/05/02/12.0.0-release/ ### C++ ### C# ### Go ### Java ### JavaScript ### Julia ### nanoarrow ### Rust ### C (GLib) ### MATLAB ### Python ### R ### Ruby ### Swift ## Release activity (This is automatically generated): RS-DATAFUSION-PYTHON-27.0.0 was released on 2023-07-08. RS-43.0.0 was released on 2023-07-03. RS-DATAFUSION-27.0.0 was released on 2023-06-30. ADBC-0.5.1 was released on 2023-06-26. NANOARROW-0.2.0 was released on 2023-06-22. ADBC-0.5.0 was released on 2023-06-20. RS-42.0.0 was released on 2023-06-20. 12.0.1 was released on 2023-06-13. JULIA-2.6.2 was released on 2023-06-12. JULIA-2.6.1 was released on 2023-06-08. RS-DATAFUSION-26.0.0 was released on 2023-06-07. RS-41.0.0 was released on 2023-06-06. RS-OS-0.6.1 was released on 2023-06-06. JULIA-2.6.0 was released on 2023-06-05. RS-DATAFUSION-25.0.0 was released on 2023-05-23. RS-40.0.0 was released on 2023-05-22. RS-OS-0.6.0 was released on 2023-05-22. ADBC-0.4.0 was released on 2023-05-12. RS-39.0.0 was released on 2023-05-09. RS-DATAFUSION-24.0.0 was released on 2023-05-09. 12.0.0 was released on 2023-05-01. RS-DATAFUSION-PYTHON-23.0.0 was released on 2023-04-28. RS-38.0.0 was released on 2023-04-25. RS-DATAFUSION-23.0.0 was released on 2023-04-24. JULIA-2.5.2 was released on 2023-04-19. JULIA-2.5.1 was released on 2023-04-16. RS-DATAFUSION-PYTHON-22.0.0 was released on 2023-04-14.