alamb commented on issue #18411: URL: https://github.com/apache/datafusion/issues/18411#issuecomment-3677762179
> > For small number of groups we can also use a faster hash map that is optimized for small number of keys > > Perhaps column statistics could be used to show the range of values is small, and switch to direct indexing instead (i.e. even faster than a hash table). If we know the range is small we could do that. I just don't think we have any statistics that can tell for sure that the range will be small. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
