leerho commented on issue #6814: [Discuss] Replacing hyperUnique as 'default' 
distinct count sketch
URL: 
https://github.com/apache/incubator-druid/issues/6814#issuecomment-454880798
 
 
   > The reason we haven't switched over to Theta Sketches internally is that 
for this specific case the devil you know can be better than the devil you 
don't. We have a lot of experience with the segment size impact and 
computational impact of the Druid HLL sketch (capacity and resource planning).
   
   Clarification:  Switching from the Druid-HLL sketch to the 
DataSketches-Theta sketch family makes no sense.  The `Theta` family of 
sketches are much larger sketches and provide set intersection functionality 
that the HLL sketches do not.  What I have been proposing is to switch over to 
the DataSketches-HLL sketch; which has comparable space utilization but with 
proven accuracy and speed advantages over the Druid-HLL sketch.  I think that 
this is what you meant to say.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to