[
https://issues.apache.org/jira/browse/HIVEMALL-18?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Makoto Yui updated HIVEMALL-18:
-------------------------------
Fix Version/s: 0.5.0
> Support approx_count UDAF using HyperLogLog
> -------------------------------------------
>
> Key: HIVEMALL-18
> URL: https://issues.apache.org/jira/browse/HIVEMALL-18
> Project: Hivemall
> Issue Type: Sub-task
> Reporter: Makoto Yui
> Assignee: Makoto Yui
> Priority: Minor
> Fix For: 0.5.0
>
>
> https://github.com/addthis/stream-lib could be used for underlying library.
> http://www.slideshare.net/bzamecnik/hyperloglog-in-hive-how-to-count-sheep-efficiently
> https://databricks.com/blog/2016/05/19/approximate-algorithms-in-apache-spark-hyperloglog-and-quantiles.html
> There exist several HLL implementations as Hive UDAF.
> https://github.com/MLnick/hive-udf/wiki
> https://github.com/klout/brickhouse
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)