This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-staging
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git
The following commit(s) were added to refs/heads/asf-staging by this push:
new 4c4b238 Commit build products
4c4b238 is described below
commit 4c4b2386f67fea74188a4b8da915d4ce24440eac
Author: Build Pelican (action) <[email protected]>
AuthorDate: Sat Jan 24 12:06:50 2026 +0000
Commit build products
---
blog/2026/01/08/datafusion-52.0.0/index.html | 8 +++++---
blog/feeds/all-en.atom.xml | 8 +++++---
blog/feeds/blog.atom.xml | 8 +++++---
blog/feeds/pmc.atom.xml | 8 +++++---
4 files changed, 20 insertions(+), 12 deletions(-)
diff --git a/blog/2026/01/08/datafusion-52.0.0/index.html
b/blog/2026/01/08/datafusion-52.0.0/index.html
index 8c9a858..0752c1e 100644
--- a/blog/2026/01/08/datafusion-52.0.0/index.html
+++ b/blog/2026/01/08/datafusion-52.0.0/index.html
@@ -166,9 +166,11 @@ dynamically to scans, as explained in the <a
href="https://datafusion.apache.org
technique referred to as <a
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486">Sideways Information
Passing</a> in Database research
literature. The initial implementation passed min/max values for the join keys.
DataFusion 52 extends the optimization (<a
href="https://github.com/apache/datafusion/issues/17171">#17171</a> / <a
href="https://github.com/apache/datafusion/pull/18393">#18393</a>) to use an
<code>IN</code> list when the
-build size is small such as when the join is very selective. The
<code>IN</code> list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows. Thanks to <a href="https://github.com/adriangb">adriangb</a>
for implementing this feature, with
+build size is small such as when the join is very selective or a reference to
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to
prune files, row groups, and
+individual rows.
+When the build side is small enough (<=20 rows but configurable) the pushed
down filters can even participate in statistics pruning to avoid even reading
the join keys from row groups that will not match.</p>
+<p>Thanks to <a href="https://github.com/adriangb">adriangb</a> for
implementing this feature, with
reviews from <a href="https://github.com/LiaCastaneda">LiaCastaneda</a>, <a
href="https://github.com/asolimando">asolimando</a>, <a
href="https://github.com/comphead">comphead</a>, and <a
href="https://github.com/mbutrovich">mbutrovich</a>.</p>
<h2 id="major-features">Major Features ✨<a class="headerlink"
href="#major-features" title="Permanent link">¶</a></h2>
<h3 id="arrow-ipc-stream-file-support">Arrow IPC Stream file support<a
class="headerlink" href="#arrow-ipc-stream-file-support" title="Permanent
link">¶</a></h3>
diff --git a/blog/feeds/all-en.atom.xml b/blog/feeds/all-en.atom.xml
index 5b43a53..1aa4cf9 100644
--- a/blog/feeds/all-en.atom.xml
+++ b/blog/feeds/all-en.atom.xml
@@ -404,9 +404,11 @@ dynamically to scans, as explained in the <a
href="https://datafusion.apache.
technique referred to as <a
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486">Sideways Information
Passing</a> in Database research
literature. The initial implementation passed min/max values for the join keys.
DataFusion 52 extends the optimization (<a
href="https://github.com/apache/datafusion/issues/17171">#17171</a> /
<a
href="https://github.com/apache/datafusion/pull/18393">#18393</a>) to
use an <code>IN</code> list when the
-build size is small such as when the join is very selective. The
<code>IN</code> list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows. Thanks to <a
href="https://github.com/adriangb">adriangb</a> for implementing this
feature, with
+build size is small such as when the join is very selective or a reference to
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to
prune files, row groups, and
+individual rows.
+When the build side is small enough (&lt;=20 rows but configurable) the
pushed down filters can even participate in statistics pruning to avoid even
reading the join keys from row groups that will not match.</p>
+<p>Thanks to <a
href="https://github.com/adriangb">adriangb</a> for implementing this
feature, with
reviews from <a
href="https://github.com/LiaCastaneda">LiaCastaneda</a>, <a
href="https://github.com/asolimando">asolimando</a>, <a
href="https://github.com/comphead">comphead</a>, and <a
href="https://github.com/mbutrovich">mbutrovich</a>.</p>
<h2 id="major-features">Major Features ✨<a class="headerlink"
href="#major-features" title="Permanent link">¶</a></h2>
<h3 id="arrow-ipc-stream-file-support">Arrow IPC Stream file
support<a class="headerlink" href="#arrow-ipc-stream-file-support"
title="Permanent link">¶</a></h3>
diff --git a/blog/feeds/blog.atom.xml b/blog/feeds/blog.atom.xml
index 7b94de3..896d617 100644
--- a/blog/feeds/blog.atom.xml
+++ b/blog/feeds/blog.atom.xml
@@ -404,9 +404,11 @@ dynamically to scans, as explained in the <a
href="https://datafusion.apache.
technique referred to as <a
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486">Sideways Information
Passing</a> in Database research
literature. The initial implementation passed min/max values for the join keys.
DataFusion 52 extends the optimization (<a
href="https://github.com/apache/datafusion/issues/17171">#17171</a> /
<a
href="https://github.com/apache/datafusion/pull/18393">#18393</a>) to
use an <code>IN</code> list when the
-build size is small such as when the join is very selective. The
<code>IN</code> list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows. Thanks to <a
href="https://github.com/adriangb">adriangb</a> for implementing this
feature, with
+build size is small such as when the join is very selective or a reference to
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to
prune files, row groups, and
+individual rows.
+When the build side is small enough (&lt;=20 rows but configurable) the
pushed down filters can even participate in statistics pruning to avoid even
reading the join keys from row groups that will not match.</p>
+<p>Thanks to <a
href="https://github.com/adriangb">adriangb</a> for implementing this
feature, with
reviews from <a
href="https://github.com/LiaCastaneda">LiaCastaneda</a>, <a
href="https://github.com/asolimando">asolimando</a>, <a
href="https://github.com/comphead">comphead</a>, and <a
href="https://github.com/mbutrovich">mbutrovich</a>.</p>
<h2 id="major-features">Major Features ✨<a class="headerlink"
href="#major-features" title="Permanent link">¶</a></h2>
<h3 id="arrow-ipc-stream-file-support">Arrow IPC Stream file
support<a class="headerlink" href="#arrow-ipc-stream-file-support"
title="Permanent link">¶</a></h3>
diff --git a/blog/feeds/pmc.atom.xml b/blog/feeds/pmc.atom.xml
index 1a0ec8d..c0c97d7 100644
--- a/blog/feeds/pmc.atom.xml
+++ b/blog/feeds/pmc.atom.xml
@@ -120,9 +120,11 @@ dynamically to scans, as explained in the <a
href="https://datafusion.apache.
technique referred to as <a
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486">Sideways Information
Passing</a> in Database research
literature. The initial implementation passed min/max values for the join keys.
DataFusion 52 extends the optimization (<a
href="https://github.com/apache/datafusion/issues/17171">#17171</a> /
<a
href="https://github.com/apache/datafusion/pull/18393">#18393</a>) to
use an <code>IN</code> list when the
-build size is small such as when the join is very selective. The
<code>IN</code> list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows. Thanks to <a
href="https://github.com/adriangb">adriangb</a> for implementing this
feature, with
+build size is small such as when the join is very selective or a reference to
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to
prune files, row groups, and
+individual rows.
+When the build side is small enough (&lt;=20 rows but configurable) the
pushed down filters can even participate in statistics pruning to avoid even
reading the join keys from row groups that will not match.</p>
+<p>Thanks to <a
href="https://github.com/adriangb">adriangb</a> for implementing this
feature, with
reviews from <a
href="https://github.com/LiaCastaneda">LiaCastaneda</a>, <a
href="https://github.com/asolimando">asolimando</a>, <a
href="https://github.com/comphead">comphead</a>, and <a
href="https://github.com/mbutrovich">mbutrovich</a>.</p>
<h2 id="major-features">Major Features ✨<a class="headerlink"
href="#major-features" title="Permanent link">¶</a></h2>
<h3 id="arrow-ipc-stream-file-support">Arrow IPC Stream file
support<a class="headerlink" href="#arrow-ipc-stream-file-support"
title="Permanent link">¶</a></h3>
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]