[GitHub] [druid] gianm opened a new pull request #11172: Add a way to retrieve UTF-8 bytes directly via DimensionDictionarySelector.

GitBox Tue, 27 Apr 2021 12:32:06 -0700


gianm opened a new pull request #11172:
URL: https://github.com/apache/druid/pull/11172



   The idea is that certain operations (like count distinct on strings) will
   be faster if they are able to run directly on UTF-8 bytes instead of on
   Java Strings decoded by "lookupName".
   
   I'm looking into modifying the cardinality and datasketch hll build
   aggregators to use lookupNameUtf8 instead of lookupName in a
   follow-on patch, and it is able to speed them up quite a bit.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [druid] gianm opened a new pull request #11172: Add a way to retrieve UTF-8 bytes directly via DimensionDictionarySelector.

Reply via email to