[GitHub] [arrow-datafusion] Dandandan commented on pull request #3862: put subquery's equal clause into join on clauses instead of filter cl…

GitBox Mon, 17 Oct 2022 14:05:55 -0700


Dandandan commented on PR #3862:
URL: 
https://github.com/apache/arrow-datafusion/pull/3862#issuecomment-1281493933


   Looking at the q2 join reveals that there is not much benefit from the 
optimization for this query, the output sizes is already pretty small for this 
join (no join that "blows up"):
   
   ```
             HashJoinExec: mode=Partitioned, join_type=Inner, on=[(Column { 
name: "p_partkey", index: 0 }, Column { name: "ps_partkey", index: 0 })], 
metrics=[output_rows=6351, input_batches=16, output_batches=16, 
input_rows=1183098, join_time=10.810036ms]
   ```
   
    I think however, adding it to the join for correlated subqueries is still a 
"safer choice"


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] Dandandan commented on pull request #3862: put subquery's equal clause into join on clauses instead of filter cl…

Reply via email to