Dandandan commented on pull request #844: URL: https://github.com/apache/arrow-datafusion/pull/844#issuecomment-895739487
I agree vectorizing that part can be hard I think it means somehow delaying the collision handling and doing it for the full batch instead. That might require implementing a different hash table data structure or ignoring the collisions in the first place. This is a good improvement over what we have. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
