pdeva opened a new issue #7269: Replace Datasketch HLL with newer Datasketch CPC sketch URL: https://github.com/apache/incubator-druid/issues/7269 ### Motivation the datasketches library has a new Unique Counting Sketch called CPC sketch that has better accuracy per size than HLL. https://github.com/DataSketches/sketches-core/releases ### Proposed changes replace Datasketch HLL sketch with CPC or offer it alongside as a higher accuracy sketch ### Rationale Better accuracy ### Operational impact The docs of datasketches don't describe the whether CPC algorithm is more CPU intensive or not. This will determine whether we want to completely deprecate DataSketches HLL sketch and replace it with CPC or keep CPC as an additional option.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
