alamb commented on PR #5322: URL: https://github.com/apache/arrow-datafusion/pull/5322#issuecomment-1441840741
> In the past, we had implement something called Range Join which internally build some range indexes, I am not familiar with Symmetric Hash Join, I will take a closer look at this PR. I believe SHJ handles a larger class of queries as a typical Range Join query, though I suspect that given all else being equal one could probably make range joins go faster as there are typically no hash tables to maintain The excellent blog post from this PR has great background on symmetric hash join https://www.synnada.ai/blog/general-purpose-stream-joins-via-pruning-symmetric-hash-joins More reading on range joins https://www.vertica.com/blog/what-is-a-range-join-and-why-is-it-so-fastba-p223413/ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org