alamb commented on issue #15628: URL: https://github.com/apache/datafusion/issues/15628#issuecomment-2816665497
TLDR while this is straight forward bug report, I think fixing it is not not something we are going to make a patch for -- it will require a more serious implementation effort for joins in DataFusion Joins implementations in general are a complex topic. Maybe it is time to mount and organize a project to improve the situation in DataFusion As @Dandandan alluded to, I don't think we need to reinvent this feature -- there is lots of prior academic work on this topic. What I suggest is that someone updates our documentation with the current state of joins in DataFusion (namely what operators are implemented and what types of joins they are used for any limitations Then we can figure out which of the many many exotic join algorithms / implementations exist would be good to move on. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org