rluvaton commented on PR #19346: URL: https://github.com/apache/datafusion/pull/19346#issuecomment-3663676787
I wanted to see for group by wide u64 and string how much saving the string hashing would save us as it's irrelevant - in wide u64 case all values are unique, so you don't need to hash by string. So yes, I was aware of that. What surprise me is that the group by wide was the one to take a long time which I explained shouldn't be as it's the same as hashing by a single column -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
