gianm opened a new pull request #11172: URL: https://github.com/apache/druid/pull/11172
The idea is that certain operations (like count distinct on strings) will be faster if they are able to run directly on UTF-8 bytes instead of on Java Strings decoded by "lookupName". I'm looking into modifying the cardinality and datasketch hll build aggregators to use lookupNameUtf8 instead of lookupName in a follow-on patch, and it is able to speed them up quite a bit. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
