[ 
https://issues.apache.org/jira/browse/METRON-627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15826479#comment-15826479
 ] 

ASF GitHub Bot commented on METRON-627:
---------------------------------------

Github user dlyle65535 commented on the issue:

    https://github.com/apache/incubator-metron/pull/397
  
    I was able to duplicate the failures in my environment by doing the 
following (based on what Travis does).
    
    ```
    mkdir mike_test
    cd mike_test
    git clone --depth=50 https://github.com/apache/incubator-metron.git 
apache/incubator-metron
    cd apache/incubator-metron/
    git fetch origin +refs/pull/397/merge
    git checkout -qf FETCH_HEAD
    mvn -q integration-test install && build_utils/verify_licenses.sh
    ```
    
    It bombs out with: 
    ```
    
cardinality_gives_distinct_value_estimate_with_precisions_set(org.apache.metron.statistics.approximation.HyperLogLogPlusFunctionsIntegrationTest)
  Time elapsed: 0.004 sec  <<< ERROR!
    org.apache.metron.common.dsl.ParseException: Unable to parse 
HLLP_CARDINALITY(
      HLLP_ADD(
        HLLP_ADD(
          HLLP_INIT(5, 6),
          val1
        ),
        val2
      )
    )
    : Syntax error @ 8:3 no viable alternative at input 
'HLLP_CARDINALITY(HLLP_ADD(HLLP_ADD(HLLP_INIT(5,6),val1),val2)\n'
    ```


> Add HyperLogLogPlus implementation to Stellar
> ---------------------------------------------
>
>                 Key: METRON-627
>                 URL: https://issues.apache.org/jira/browse/METRON-627
>             Project: Metron
>          Issue Type: Improvement
>            Reporter: Michael Miklavcic
>            Assignee: Michael Miklavcic
>
> Calculating set cardinality can be a useful tool for a security analyst. For 
> instance, a large volume of non-unique src ip addresses hitting your network 
> may be an indication that you are currently under attack. There have been 
> many advancements in distinct value (DV) estimation over the years. We have 
> seen implementations evolve from K-Minimum-Values (KMV), to LogLog, to 
> HyperLogLog, and now to Google's much-improved HyperLogLogPlu algorithm. The 
> key improvements in this latest manifestation of the algorithm are:
> moves to a 64-bit hash
> handles sparse sets
> is more accurate with small cardinality
> This Jira tracks the effort to add a HyperLogLogPlus implementation to Metron.
> References:
> https://research.neustar.biz/2013/01/24/hyperloglog-googles-take-on-engineering-hll/
> http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/40671.pdf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to