[ 
https://issues.apache.org/jira/browse/SPARK-18559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhenhua Wang updated SPARK-18559:
---------------------------------
    Description: In HyperLogLogPlusPlus, if the relative error is so small that 
p >= 19, it will cause ArrayIndexOutOfBoundsException in THRESHOLDS(p-4) . We 
should check p and when p >= 19, regress to the original HLL result and use the 
small range correction they use.  (was: In HyperLogLogPlusPlus, THRESHOLDS, 
RAW_ESTIMATE_DATA and BIAS_DATA all have the same length 15, and we probe these 
arrays by p-4, so we need to guarantee 0 <= p - 4 <= 14. Otherwise it will 
cause ArrayIndexOutOfBoundsException.)

> Restrict the lower bound of relativeSD in HLL++
> -----------------------------------------------
>
>                 Key: SPARK-18559
>                 URL: https://issues.apache.org/jira/browse/SPARK-18559
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>            Reporter: Zhenhua Wang
>
> In HyperLogLogPlusPlus, if the relative error is so small that p >= 19, it 
> will cause ArrayIndexOutOfBoundsException in THRESHOLDS(p-4) . We should 
> check p and when p >= 19, regress to the original HLL result and use the 
> small range correction they use.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to