js8544 commented on PR #38394: URL: https://github.com/apache/arrow/pull/38394#issuecomment-1774562777
It was mentioned in https://github.com/apache/arrow/pull/9683#issuecomment-800442398 that we can compute the result of each chunk and then merge them with `hash_sum`. However, since hash aggregate functions are moved to acero. It's less ideal to have compute kernels depend on acero because it's a level higher in the dependency tree. This PR saves calls of the dictionary unifier, we can also further optimize this by optimizing the unifying process. This will be done once we have a faster hashtable: #38372. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
