pdeva opened a new issue #7269: Replace Datasketch HLL with newer Datasketch 
CPC sketch
URL: https://github.com/apache/incubator-druid/issues/7269
 
 
   ### Motivation
   
   the datasketches library has a new  Unique Counting Sketch called CPC sketch 
that has  better accuracy per size than HLL.
   
   https://github.com/DataSketches/sketches-core/releases
   
   ### Proposed changes
   
   replace  Datasketch HLL sketch with CPC or offer it alongside as a higher 
accuracy sketch
   
   ### Rationale
   
   Better accuracy
   
   ### Operational impact
   
   The docs of datasketches don't describe the whether CPC algorithm is more 
CPU intensive or not. This will determine whether we want to completely 
deprecate DataSketches HLL sketch and replace it with CPC or keep CPC as an 
additional option.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to