This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-staging
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git
The following commit(s) were added to refs/heads/asf-staging by this push:
new 8656449 Commit build products
8656449 is described below
commit 86564490c7e5d55b22c3a6e86190414942e47325
Author: Build Pelican (action) <[email protected]>
AuthorDate: Tue Jan 20 17:25:31 2026 +0000
Commit build products
---
blog/2026/01/08/datafusion-52.0.0/index.html | 6 +++---
blog/feeds/all-en.atom.xml | 6 +++---
blog/feeds/blog.atom.xml | 6 +++---
blog/feeds/pmc.atom.xml | 6 +++---
4 files changed, 12 insertions(+), 12 deletions(-)
diff --git a/blog/2026/01/08/datafusion-52.0.0/index.html
b/blog/2026/01/08/datafusion-52.0.0/index.html
index 9d41431..94cb052 100644
--- a/blog/2026/01/08/datafusion-52.0.0/index.html
+++ b/blog/2026/01/08/datafusion-52.0.0/index.html
@@ -96,7 +96,7 @@ making this release possible.</p>
<p>TODO: confirm the release date for 52.0.0 and update the front matter if
needed.</p>
<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
<p>We continue to make significant performance improvements in DataFusion. This
-release includes faster <code>CASE</code> expressions (see below), a new
SortMergeJoin,
+release includes faster <code>CASE</code> expressions (see below),
SortMergeJoin buffering optimizations,
automatic caching of metadata, statistics, and listing results for
ListingTable,
improved hashing and grouping performance for string types, and string function
optimizations.</p>
@@ -105,7 +105,7 @@ optimizations.</p>
<p><img alt="Performance over time" class="img-responsive"
src="/blog/images/datafusion-52.0.0/performance_over_time_clickbench.png"
width="100%"/></p>
<p><strong>Figure 1</strong>: TODO: update caption for 52.0.0 benchmarking
results.</p>
<h3 id="faster-case-expression-evaluation">Faster <code>CASE</code> expression
evaluation<a class="headerlink" href="#faster-case-expression-evaluation"
title="Permanent link">¶</a></h3>
-<p>DataFusion 52 completes major work from the CASE performance epic (<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
+<p>DataFusion 52 completes major work from the <code>CASE</code> performance
epic (<a href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
Lookup-table based evaluation avoids repeated expression evaluation and reduces
branching overhead, accelerating common ETL patterns.</p>
<p>Example:</p>
@@ -121,7 +121,7 @@ GROUP BY 1;
</code></pre>
<p>Related PRs: <a
href="https://github.com/apache/datafusion/pull/18183">#18183</a></p>
<h3 id="rewritten-merge-join">Rewritten merge join<a class="headerlink"
href="#rewritten-merge-join" title="Permanent link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join output buffering to
+<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
for final output. This change targets pathological slowdowns like the reported
LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>, which also
affected Comet workloads that rely on
diff --git a/blog/feeds/all-en.atom.xml b/blog/feeds/all-en.atom.xml
index e0ba1d4..8712152 100644
--- a/blog/feeds/all-en.atom.xml
+++ b/blog/feeds/all-en.atom.xml
@@ -332,7 +332,7 @@ making this release possible.</p>
<p>TODO: confirm the release date for 52.0.0 and update the front matter
if needed.</p>
<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
<p>We continue to make significant performance improvements in
DataFusion. This
-release includes faster <code>CASE</code> expressions (see below),
a new SortMergeJoin,
+release includes faster <code>CASE</code> expressions (see below),
SortMergeJoin buffering optimizations,
automatic caching of metadata, statistics, and listing results for
ListingTable,
improved hashing and grouping performance for string types, and string function
optimizations.</p>
@@ -341,7 +341,7 @@ optimizations.</p>
<p><img alt="Performance over time" class="img-responsive"
src="/blog/images/datafusion-52.0.0/performance_over_time_clickbench.png"
width="100%"/></p>
<p><strong>Figure 1</strong>: TODO: update caption for
52.0.0 benchmarking results.</p>
<h3 id="faster-case-expression-evaluation">Faster
<code>CASE</code> expression evaluation<a class="headerlink"
href="#faster-case-expression-evaluation" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 completes major work from the CASE performance epic
(<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
+<p>DataFusion 52 completes major work from the
<code>CASE</code> performance epic (<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
Lookup-table based evaluation avoids repeated expression evaluation and reduces
branching overhead, accelerating common ETL patterns.</p>
<p>Example:</p>
@@ -357,7 +357,7 @@ GROUP BY 1;
</code></pre>
<p>Related PRs: <a
href="https://github.com/apache/datafusion/pull/18183">#18183</a></p>
<h3 id="rewritten-merge-join">Rewritten merge join<a
class="headerlink" href="#rewritten-merge-join" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join output
buffering to
+<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
for final output. This change targets pathological slowdowns like the reported
LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
which also affected Comet workloads that rely on
diff --git a/blog/feeds/blog.atom.xml b/blog/feeds/blog.atom.xml
index 2bde99e..69d73ae 100644
--- a/blog/feeds/blog.atom.xml
+++ b/blog/feeds/blog.atom.xml
@@ -332,7 +332,7 @@ making this release possible.</p>
<p>TODO: confirm the release date for 52.0.0 and update the front matter
if needed.</p>
<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
<p>We continue to make significant performance improvements in
DataFusion. This
-release includes faster <code>CASE</code> expressions (see below),
a new SortMergeJoin,
+release includes faster <code>CASE</code> expressions (see below),
SortMergeJoin buffering optimizations,
automatic caching of metadata, statistics, and listing results for
ListingTable,
improved hashing and grouping performance for string types, and string function
optimizations.</p>
@@ -341,7 +341,7 @@ optimizations.</p>
<p><img alt="Performance over time" class="img-responsive"
src="/blog/images/datafusion-52.0.0/performance_over_time_clickbench.png"
width="100%"/></p>
<p><strong>Figure 1</strong>: TODO: update caption for
52.0.0 benchmarking results.</p>
<h3 id="faster-case-expression-evaluation">Faster
<code>CASE</code> expression evaluation<a class="headerlink"
href="#faster-case-expression-evaluation" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 completes major work from the CASE performance epic
(<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
+<p>DataFusion 52 completes major work from the
<code>CASE</code> performance epic (<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
Lookup-table based evaluation avoids repeated expression evaluation and reduces
branching overhead, accelerating common ETL patterns.</p>
<p>Example:</p>
@@ -357,7 +357,7 @@ GROUP BY 1;
</code></pre>
<p>Related PRs: <a
href="https://github.com/apache/datafusion/pull/18183">#18183</a></p>
<h3 id="rewritten-merge-join">Rewritten merge join<a
class="headerlink" href="#rewritten-merge-join" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join output
buffering to
+<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
for final output. This change targets pathological slowdowns like the reported
LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
which also affected Comet workloads that rely on
diff --git a/blog/feeds/pmc.atom.xml b/blog/feeds/pmc.atom.xml
index b889d20..1a99be2 100644
--- a/blog/feeds/pmc.atom.xml
+++ b/blog/feeds/pmc.atom.xml
@@ -48,7 +48,7 @@ making this release possible.</p>
<p>TODO: confirm the release date for 52.0.0 and update the front matter
if needed.</p>
<h2 id="performance-improvements">Performance Improvements 🚀<a
class="headerlink" href="#performance-improvements" title="Permanent
link">¶</a></h2>
<p>We continue to make significant performance improvements in
DataFusion. This
-release includes faster <code>CASE</code> expressions (see below),
a new SortMergeJoin,
+release includes faster <code>CASE</code> expressions (see below),
SortMergeJoin buffering optimizations,
automatic caching of metadata, statistics, and listing results for
ListingTable,
improved hashing and grouping performance for string types, and string function
optimizations.</p>
@@ -57,7 +57,7 @@ optimizations.</p>
<p><img alt="Performance over time" class="img-responsive"
src="/blog/images/datafusion-52.0.0/performance_over_time_clickbench.png"
width="100%"/></p>
<p><strong>Figure 1</strong>: TODO: update caption for
52.0.0 benchmarking results.</p>
<h3 id="faster-case-expression-evaluation">Faster
<code>CASE</code> expression evaluation<a class="headerlink"
href="#faster-case-expression-evaluation" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 completes major work from the CASE performance epic
(<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
+<p>DataFusion 52 completes major work from the
<code>CASE</code> performance epic (<a
href="https://github.com/apache/datafusion/issues/18075">#18075</a>).
Lookup-table based evaluation avoids repeated expression evaluation and reduces
branching overhead, accelerating common ETL patterns.</p>
<p>Example:</p>
@@ -73,7 +73,7 @@ GROUP BY 1;
</code></pre>
<p>Related PRs: <a
href="https://github.com/apache/datafusion/pull/18183">#18183</a></p>
<h3 id="rewritten-merge-join">Rewritten merge join<a
class="headerlink" href="#rewritten-merge-join" title="Permanent
link">¶</a></h3>
-<p>DataFusion 52 includes a rewrite of the sort-merge join output
buffering to
+<p>DataFusion 52 includes a rewrite of the sort-merge join (SMJ) output
buffering to
avoid excessive <code>concat_batches</code> work and to use
<code>BatchCoalescer</code> internally and
for final output. This change targets pathological slowdowns like the reported
LeftAnti join case in <a
href="https://github.com/apache/datafusion/issues/18487">#18487</a>,
which also affected Comet workloads that rely on
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]