Dandandan commented on PR #3862:
URL:
https://github.com/apache/arrow-datafusion/pull/3862#issuecomment-1281493933
Looking at the q2 join reveals that there is not much benefit from the
optimization for this query, the output sizes is already pretty small for this
join (no join that "blows up"):
```
HashJoinExec: mode=Partitioned, join_type=Inner, on=[(Column {
name: "p_partkey", index: 0 }, Column { name: "ps_partkey", index: 0 })],
metrics=[output_rows=6351, input_batches=16, output_batches=16,
input_rows=1183098, join_time=10.810036ms]
```
I think however, adding it to the join for correlated subqueries is still a
"safer choice"
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]