(datafusion-site) branch asf-staging updated: Commit build products

github-bot Sat, 24 Jan 2026 04:07:19 -0800

This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-staging
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git



The following commit(s) were added to refs/heads/asf-staging by this push:
     new 4c4b238  Commit build products
4c4b238 is described below

commit 4c4b2386f67fea74188a4b8da915d4ce24440eac
Author: Build Pelican (action) <[email protected]>
AuthorDate: Sat Jan 24 12:06:50 2026 +0000

    Commit build products
---
 blog/2026/01/08/datafusion-52.0.0/index.html | 8 +++++---
 blog/feeds/all-en.atom.xml                   | 8 +++++---
 blog/feeds/blog.atom.xml                     | 8 +++++---
 blog/feeds/pmc.atom.xml                      | 8 +++++---
 4 files changed, 20 insertions(+), 12 deletions(-)

diff --git a/blog/2026/01/08/datafusion-52.0.0/index.html 
b/blog/2026/01/08/datafusion-52.0.0/index.html
index 8c9a858..0752c1e 100644
--- a/blog/2026/01/08/datafusion-52.0.0/index.html
+++ b/blog/2026/01/08/datafusion-52.0.0/index.html
@@ -166,9 +166,11 @@ dynamically to scans, as explained in the <a 
href="https://datafusion.apache.org
 technique referred to as <a 
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486";>Sideways Information 
Passing</a> in Database research
 literature. The initial implementation passed min/max values for the join keys.
 DataFusion 52 extends the optimization (<a 
href="https://github.com/apache/datafusion/issues/17171";>#17171</a> / <a 
href="https://github.com/apache/datafusion/pull/18393";>#18393</a>) to use an 
<code>IN</code> list when the
-build size is small such as when the join is very selective. The 
<code>IN</code> list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows.  Thanks to <a href="https://github.com/adriangb";>adriangb</a> 
for implementing this feature, with
+build size is small such as when the join is very selective or a reference to 
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to 
prune files, row groups, and
+individual rows.
+When the build side is small enough (&lt;=20 rows but configurable) the pushed 
down filters can even participate in statistics pruning to avoid even reading 
the join keys from row groups that will not match.</p>
+<p>Thanks to <a href="https://github.com/adriangb";>adriangb</a> for 
implementing this feature, with
 reviews from <a href="https://github.com/LiaCastaneda";>LiaCastaneda</a>, <a 
href="https://github.com/asolimando";>asolimando</a>, <a 
href="https://github.com/comphead";>comphead</a>, and <a 
href="https://github.com/mbutrovich";>mbutrovich</a>.</p>
 <h2 id="major-features">Major Features ✨<a class="headerlink" 
href="#major-features" title="Permanent link">¶</a></h2>
 <h3 id="arrow-ipc-stream-file-support">Arrow IPC Stream file support<a 
class="headerlink" href="#arrow-ipc-stream-file-support" title="Permanent 
link">¶</a></h3>
diff --git a/blog/feeds/all-en.atom.xml b/blog/feeds/all-en.atom.xml
index 5b43a53..1aa4cf9 100644
--- a/blog/feeds/all-en.atom.xml
+++ b/blog/feeds/all-en.atom.xml
@@ -404,9 +404,11 @@ dynamically to scans, as explained in the &lt;a 
href="https://datafusion.apache.
 technique referred to as &lt;a 
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486"&gt;Sideways Information 
Passing&lt;/a&gt; in Database research
 literature. The initial implementation passed min/max values for the join keys.
 DataFusion 52 extends the optimization (&lt;a 
href="https://github.com/apache/datafusion/issues/17171"&gt;#17171&lt;/a&gt; / 
&lt;a 
href="https://github.com/apache/datafusion/pull/18393"&gt;#18393&lt;/a&gt;) to 
use an &lt;code&gt;IN&lt;/code&gt; list when the
-build size is small such as when the join is very selective. The 
&lt;code&gt;IN&lt;/code&gt; list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows.  Thanks to &lt;a 
href="https://github.com/adriangb"&gt;adriangb&lt;/a&gt; for implementing this 
feature, with
+build size is small such as when the join is very selective or a reference to 
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to 
prune files, row groups, and
+individual rows.
+When the build side is small enough (&amp;lt;=20 rows but configurable) the 
pushed down filters can even participate in statistics pruning to avoid even 
reading the join keys from row groups that will not match.&lt;/p&gt;
+&lt;p&gt;Thanks to &lt;a 
href="https://github.com/adriangb"&gt;adriangb&lt;/a&gt; for implementing this 
feature, with
 reviews from &lt;a 
href="https://github.com/LiaCastaneda"&gt;LiaCastaneda&lt;/a&gt;, &lt;a 
href="https://github.com/asolimando"&gt;asolimando&lt;/a&gt;, &lt;a 
href="https://github.com/comphead"&gt;comphead&lt;/a&gt;, and &lt;a 
href="https://github.com/mbutrovich"&gt;mbutrovich&lt;/a&gt;.&lt;/p&gt;
 &lt;h2 id="major-features"&gt;Major Features ✨&lt;a class="headerlink" 
href="#major-features" title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h2&gt;
 &lt;h3 id="arrow-ipc-stream-file-support"&gt;Arrow IPC Stream file 
support&lt;a class="headerlink" href="#arrow-ipc-stream-file-support" 
title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
diff --git a/blog/feeds/blog.atom.xml b/blog/feeds/blog.atom.xml
index 7b94de3..896d617 100644
--- a/blog/feeds/blog.atom.xml
+++ b/blog/feeds/blog.atom.xml
@@ -404,9 +404,11 @@ dynamically to scans, as explained in the &lt;a 
href="https://datafusion.apache.
 technique referred to as &lt;a 
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486"&gt;Sideways Information 
Passing&lt;/a&gt; in Database research
 literature. The initial implementation passed min/max values for the join keys.
 DataFusion 52 extends the optimization (&lt;a 
href="https://github.com/apache/datafusion/issues/17171"&gt;#17171&lt;/a&gt; / 
&lt;a 
href="https://github.com/apache/datafusion/pull/18393"&gt;#18393&lt;/a&gt;) to 
use an &lt;code&gt;IN&lt;/code&gt; list when the
-build size is small such as when the join is very selective. The 
&lt;code&gt;IN&lt;/code&gt; list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows.  Thanks to &lt;a 
href="https://github.com/adriangb"&gt;adriangb&lt;/a&gt; for implementing this 
feature, with
+build size is small such as when the join is very selective or a reference to 
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to 
prune files, row groups, and
+individual rows.
+When the build side is small enough (&amp;lt;=20 rows but configurable) the 
pushed down filters can even participate in statistics pruning to avoid even 
reading the join keys from row groups that will not match.&lt;/p&gt;
+&lt;p&gt;Thanks to &lt;a 
href="https://github.com/adriangb"&gt;adriangb&lt;/a&gt; for implementing this 
feature, with
 reviews from &lt;a 
href="https://github.com/LiaCastaneda"&gt;LiaCastaneda&lt;/a&gt;, &lt;a 
href="https://github.com/asolimando"&gt;asolimando&lt;/a&gt;, &lt;a 
href="https://github.com/comphead"&gt;comphead&lt;/a&gt;, and &lt;a 
href="https://github.com/mbutrovich"&gt;mbutrovich&lt;/a&gt;.&lt;/p&gt;
 &lt;h2 id="major-features"&gt;Major Features ✨&lt;a class="headerlink" 
href="#major-features" title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h2&gt;
 &lt;h3 id="arrow-ipc-stream-file-support"&gt;Arrow IPC Stream file 
support&lt;a class="headerlink" href="#arrow-ipc-stream-file-support" 
title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h3&gt;
diff --git a/blog/feeds/pmc.atom.xml b/blog/feeds/pmc.atom.xml
index 1a0ec8d..c0c97d7 100644
--- a/blog/feeds/pmc.atom.xml
+++ b/blog/feeds/pmc.atom.xml
@@ -120,9 +120,11 @@ dynamically to scans, as explained in the &lt;a 
href="https://datafusion.apache.
 technique referred to as &lt;a 
href="https://dl.acm.org/doi/10.1109/ICDE.2008.4497486"&gt;Sideways Information 
Passing&lt;/a&gt; in Database research
 literature. The initial implementation passed min/max values for the join keys.
 DataFusion 52 extends the optimization (&lt;a 
href="https://github.com/apache/datafusion/issues/17171"&gt;#17171&lt;/a&gt; / 
&lt;a 
href="https://github.com/apache/datafusion/pull/18393"&gt;#18393&lt;/a&gt;) to 
use an &lt;code&gt;IN&lt;/code&gt; list when the
-build size is small such as when the join is very selective. The 
&lt;code&gt;IN&lt;/code&gt; list is
-pushed down to the probe side scan and is used to prune files, row groups, and
-individual rows.  Thanks to &lt;a 
href="https://github.com/adriangb"&gt;adriangb&lt;/a&gt; for implementing this 
feature, with
+build size is small such as when the join is very selective or a reference to 
the build side hash map when the build side is larger.
+These new expressions are pushed down to the probe side scan and is used to 
prune files, row groups, and
+individual rows.
+When the build side is small enough (&amp;lt;=20 rows but configurable) the 
pushed down filters can even participate in statistics pruning to avoid even 
reading the join keys from row groups that will not match.&lt;/p&gt;
+&lt;p&gt;Thanks to &lt;a 
href="https://github.com/adriangb"&gt;adriangb&lt;/a&gt; for implementing this 
feature, with
 reviews from &lt;a 
href="https://github.com/LiaCastaneda"&gt;LiaCastaneda&lt;/a&gt;, &lt;a 
href="https://github.com/asolimando"&gt;asolimando&lt;/a&gt;, &lt;a 
href="https://github.com/comphead"&gt;comphead&lt;/a&gt;, and &lt;a 
href="https://github.com/mbutrovich"&gt;mbutrovich&lt;/a&gt;.&lt;/p&gt;
 &lt;h2 id="major-features"&gt;Major Features ✨&lt;a class="headerlink" 
href="#major-features" title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h2&gt;
 &lt;h3 id="arrow-ipc-stream-file-support"&gt;Arrow IPC Stream file 
support&lt;a class="headerlink" href="#arrow-ipc-stream-file-support" 
title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h3&gt;


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(datafusion-site) branch asf-staging updated: Commit build products

Reply via email to