alamb commented on code in PR #91: URL: https://github.com/apache/datafusion-site/pull/91#discussion_r2216920283
########## content/blog/2025-07-25-datafusion-49.0.0.md: ########## @@ -0,0 +1,207 @@ + --- +layout: post +title: Apache DataFusion 49.0.0 Released +date: 2025-07-20 +author: pmc +categories: [release] +--- + +<!-- +{% comment %} +Licensed to the Apache Software Foundation (ASF) under one or more +contributor license agreements. See the NOTICE file distributed with +this work for additional information regarding copyright ownership. +The ASF licenses this file to you under the Apache License, Version 2.0 +(the "License"); you may not use this file except in compliance with +the License. You may obtain a copy of the License at + +http://www.apache.org/licenses/LICENSE-2.0 + +Unless required by applicable law or agreed to in writing, software +distributed under the License is distributed on an "AS IS" BASIS, +WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +See the License for the specific language governing permissions and +limitations under the License. +{% endcomment %} +--> + +<!-- see https://github.com/apache/datafusion/issues/16347 for details --> + +## Introduction + +We are very proud to announce [DataFusion 49.0.0]. This blog highlights some of the +many major improvements since we released [DataFusion 48.0.0]. This new version +represents a significant milestone for the project, packing in a wide range of +improvements and fixes. You can find the complete details in the full [changelog]. + +[DataFusion 49.0.0]: https://crates.io/crates/datafusion/49.0.0 +[DataFusion 48.0.0]: https://datafusion.apache.org/blog/2025/07/18/datafusion-48.0.0/ +[changelog]: https://github.com/apache/datafusion/blob/branch-49/dev/changelog/49.0.0.md + +[Apache DataFusion] is an extensible query engine, written in [Rust], that +uses [Apache Arrow] as its in-memory format. DataFusion is used by developers to +create new, fast data centric systems such as databases, dataframe libraries, +machine learning and streaming applications. While [DataFusion’s primary design +goal] is to accelerate the creation of other data centric systems, it has a +reasonable experience directly out of the box as a [dataframe library], +[python library] and [command line SQL tool]. + +[apache datafusion]: https://datafusion.apache.org/ +[rust]: https://www.rust-lang.org/ +[apache arrow]: https://arrow.apache.org +[DataFusion’s primary design goal]: https://datafusion.apache.org/user-guide/introduction.html#project-goals +[dataframe library]: https://datafusion.apache.org/user-guide/dataframe.html +[python library]: https://datafusion.apache.org/python/ +[command line SQL tool]: https://datafusion.apache.org/user-guide/cli/ + +DataFusion's core thesis is that as a community, together we can build much more +advanced technology than any of us as individuals or companies could do alone. +Without DataFusion, highly performant vectorized query engines would remain +the domain of a few large companies and world-class research institutions. +With DataFusion, we can all build on top of a shared foundation, and focus on +what makes our projects unique. + +## Community Growth 📈 + +In the last few months, between `46.0.0` and `49.0.0`, our community continues to grow in new and exciting ways. + +1. We added several PMC members and new committers: [berkay], [xudong963] and [timsaucer] joined the PMC, + [blaginin], [milenkovicm], [adriangb] and [kosiew] joined as committers. See the [mailing list] for more details. +2. In the [core DataFusion repo] alone we reviewed and accepted almost fixme PRs from fixme different + committers, created over fixme issues and closed fixme of them 🚀. All changes are listed in the detailed + [changelogs]. +3. DataFusion published a number of blog posts including [User defined Window Functions], Optimizing SQL (and DataFrames) + in DataFusion [part 1], [part 2] and [Using Rust async for Query Execution and Cancelling Long-Running Queries]. + +[core DataFusion repo]: https://github.com/apache/arrow-datafusion +[changelogs]: https://github.com/apache/datafusion/tree/main/dev/changelog +[mailing list]: https://lists.apache.org/list.html?d...@datafusion.apache.org +[berkay]: https://github.com/berkaysynnada +[xudong963]: https://github.com/xudong963 +[timsaucer]: https://github.com/timsaucer +[blaginin]: https://github.com/blaginin +[milenkovicm]: https://github.com/milenkovicm +[adriangb]: https://github.com/adriangb +[kosiew]: https://github.com/kosiew +[Use defined Window Functions]: https://datafusion.apache.org/blog/2025/04/19/user-defined-window-functions +[part 1]: https://datafusion.apache.org/blog/2025/06/15/optimizing-sql-dataframes-part-one +[part 2]: https://datafusion.apache.org/blog/2025/06/15/optimizing-sql-dataframes-part-two +[Using Rust async for Query Execution and Cancelling Long-Running Queries]: https://datafusion.apache.org/blog/2025/06/30/cancellation + +<!-- +$ git log --pretty=oneline 48.0.0..49.0.0 . | wc -l + 1532 (fixme) + +$ git shortlog -sn 48.0.0..49.0.0 . | wc -l + 206 (fixme) + +https://crates.io/crates/datafusion/49.0.0 +DataFusion 49 released July 25, 2025 + +https://crates.io/crates/datafusion/46.0.0 +DataFusion 46 released March 7, 2025 + +Issues created in this time: 271 open, 320 closed +https://github.com/apache/datafusion/issues?q=is%3Aissue+created%3A2025-03-07..2025-07-25 + +Issues closed: 440 +https://github.com/apache/arrow-datafusion/issues?q=is%3Aissue+closed%3A2025-03-07..2025-07-25 + +PRs merged in this time 751 +https://github.com/apache/arrow-datafusion/pulls?q=is%3Apr+merged%3A2025-03-07..2025-07-25 + +--> + +## Breaking Changes Review Comment: SOme feedback I got on other posts was that leading with breaking changes was kind of jarring -- I think the link to the upgrade guide at the end of the post is probably adequate for now -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org