neilconway opened a new issue, #22673: URL: https://github.com/apache/datafusion/issues/22673
### Describe the bug `estimate_join_cardinality` for semi-joins checks if ANY of the columns in the two join inputs are disjoint (comparing columns positionally); if so, it claims the join will not return any rows. This is wrong, for two reasons: 1. If two columns don't participate in the join key, they have no impact on the cardinality of the join result 2. Comparing arbitrary columns positionally is not a sensible thing to do in the first place A similar issue exists for anti-joins, except we assume the anti-join will return the entire join input in this case. ### To Reproduce _No response_ ### Expected behavior _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
