mbutrovich commented on PR #22828: URL: https://github.com/apache/datafusion/pull/22828#issuecomment-4652833054
> ## Rationale for this change > Right semi and right anti joins only need to know whether each probe-side row has at least one build-side match. The existing hash join probe path can materialize every build-side match before the semi/anti output projection, which is unnecessarily expensive for high-fanout build keys. Seems similar to what I did for SMJ in https://github.com/apache/datafusion/pull/20806. There was a ton of wasted materialization work before. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
