neilconway opened a new issue, #22673:
URL: https://github.com/apache/datafusion/issues/22673

   ### Describe the bug
   
   `estimate_join_cardinality` for semi-joins checks if ANY of the columns in 
the two join inputs are disjoint (comparing columns positionally); if so, it 
claims the join will not return any rows. This is wrong, for two reasons:
   
   1. If two columns don't participate in the join key, they have no impact on 
the cardinality of the join result
   2. Comparing arbitrary columns positionally is not a sensible thing to do in 
the first place
   
   A similar issue exists for anti-joins, except we assume the anti-join will 
return the entire join input in this case.
   
   ### To Reproduce
   
   _No response_
   
   ### Expected behavior
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to