leerho commented on issue #6814: [Discuss] Replacing hyperUnique as 'default' 
distinct count sketch
URL: 
https://github.com/apache/incubator-druid/issues/6814#issuecomment-454891552
 
 
   > HLL based counting functionality is a pretty standard thing in databases 
these days.
   
   True.  And this is why a number of database companies have adopted the 
DataSketches library into their core code for performing internal query 
optimization and other internal data analysis tasks.  Not only to use HLL or 
Theta for unique counting, but also to use the Quantiles sketches for 
understanding data distributions, and the Frequency sketches for capturing “Top 
N” data items that may require special handling because of their large number 
of occurrences.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to