alamb commented on PR #5322:
URL: 
https://github.com/apache/arrow-datafusion/pull/5322#issuecomment-1441840741

   >  In the past, we had implement something called Range Join which 
internally build some range indexes, I am not familiar with Symmetric Hash 
Join, I will take a closer look at this PR.
   
   I believe SHJ handles a larger class of queries as a typical Range Join 
query, though I suspect that given all else being equal one could probably make 
range joins go faster as there are typically no hash tables to maintain
   
   The excellent blog post from this PR has great background on symmetric hash 
join 
https://www.synnada.ai/blog/general-purpose-stream-joins-via-pruning-symmetric-hash-joins
   
   More reading on range joins 
https://www.vertica.com/blog/what-is-a-range-join-and-why-is-it-so-fastba-p223413/
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to