xuanzih opened a new issue #5153: different result on fasthll and distinctcounthll URL: https://github.com/apache/incubator-pinot/issues/5153 hi guys, we are trying to switch from fasthll to distinctcounthll. `com.clearspring.analytics.stream.cardinality.HyperLogLog;` is used in code and `org.apache.pinot.core.startree.hll.HllUtil` to serialize the hll to a string. with the same condition we have 1000x difference. Example: ``` SELECT fasthll(my_hll), distinctcounthll(my_hll) FROM counts_table WHERE timestamp >= 1500768000 ``` I get results: ``` "aggregationResults": [ { "function": "fastHLL_my_hll", "value": "68685244" }, { "function": "distinctCountHLL_my_hll", "value": "50535" }] ``` Could anyone suggest what's the big difference between them?
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
