xuanzih opened a new issue #5153: different result on fasthll and 
distinctcounthll
URL: https://github.com/apache/incubator-pinot/issues/5153
 
 
   hi guys, we are trying to switch from fasthll to distinctcounthll.
   `com.clearspring.analytics.stream.cardinality.HyperLogLog;` is used in code 
and `org.apache.pinot.core.startree.hll.HllUtil` to serialize the hll to a 
string.
   with the same condition we have 1000x difference.
   Example:
   ```
   SELECT fasthll(my_hll), distinctcounthll(my_hll)
   FROM counts_table WHERE timestamp >= 1500768000
   ```
   I get results:
   ```
   "aggregationResults": [
       {
           "function": "fastHLL_my_hll",
           "value": "68685244"
       }, {
           "function": "distinctCountHLL_my_hll",
           "value": "50535"
       }]
   ```
   Could anyone suggest what's the big difference between them?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to