alamb commented on issue #790: URL: https://github.com/apache/arrow-datafusion/issues/790#issuecomment-888534791
> From the hashing side, an unknown to me atm is how to efficiently hash values+validity. I.e. given V = ["a", "", "c"] and N = [true, false, true], @jorgecarleitao -- excellent point. I have some idea of how to potentially benchmark these / keep the code separate to allow switching in different implementations. Thank you for the options (I was planning on doing the first thing you suggested, but it is good to think about the others. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
