[
https://issues.apache.org/jira/browse/SPARK-18559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhenhua Wang updated SPARK-18559:
---------------------------------
Description: In HyperLogLogPlusPlus, if the relative error is so small that
p >= 19, it will cause ArrayIndexOutOfBoundsException in THRESHOLDS(p-4) . We
should check p and when p >= 19, regress to the original HLL result and use the
small range correction they use. (was: In HyperLogLogPlusPlus, THRESHOLDS,
RAW_ESTIMATE_DATA and BIAS_DATA all have the same length 15, and we probe these
arrays by p-4, so we need to guarantee 0 <= p - 4 <= 14. Otherwise it will
cause ArrayIndexOutOfBoundsException.)
> Restrict the lower bound of relativeSD in HLL++
> -----------------------------------------------
>
> Key: SPARK-18559
> URL: https://issues.apache.org/jira/browse/SPARK-18559
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Reporter: Zhenhua Wang
>
> In HyperLogLogPlusPlus, if the relative error is so small that p >= 19, it
> will cause ArrayIndexOutOfBoundsException in THRESHOLDS(p-4) . We should
> check p and when p >= 19, regress to the original HLL result and use the
> small range correction they use.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]