Andy Grove created ARROW-10782:
----------------------------------

             Summary: [Rust] [DataFusion] Optimize hash join to use smaller 
relation as build side
                 Key: ARROW-10782
                 URL: https://issues.apache.org/jira/browse/ARROW-10782
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Rust - DataFusion
            Reporter: Andy Grove


When performing an inner join using the hash join algorithm, it is more 
efficient to load the smaller table into memory and then stream the larger 
table.

We should the statistics made available in 
https://issues.apache.org/jira/browse/ARROW-10781 to build an optimizer rule to 
determine the smaller side of a join and use that as the build/hash side.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to