[
https://issues.apache.org/jira/browse/METRON-637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15768509#comment-15768509
]
ASF GitHub Bot commented on METRON-637:
---------------------------------------
Github user cestella commented on the issue:
https://github.com/apache/incubator-metron/pull/401
Thanks for the feedback, @mattf-horton! I went ahead and incorporated your
changes. For posterity and so this gets replicated to the JIRA, I added a
`BIN` function that just takes a list of bounds, not a list of percentiles to
compute. `STATS_BIN` could be encoded with `BIN` if we had a `MAP` function
(i.e. `STATS_BIN` == `BIN( value, MAP( &STATS_GET_PERCENTILE(stats, _ ), [
25.0, 50.0, 75.0])` where `MAP` takes a function pointer and applies it to a
collection).
We do not have this capability yet in Stellar, but it may be worth
considering to enable these kinds of use-cases. Just a thought.
> Add a STATS_BIN function to Stellar.
> ------------------------------------
>
> Key: METRON-637
> URL: https://issues.apache.org/jira/browse/METRON-637
> Project: Metron
> Issue Type: Improvement
> Reporter: Casey Stella
> Original Estimate: 48h
> Remaining Estimate: 48h
>
> When passing parameters to models, it's often useful to pass the binned
> representation of a variable based on an empirical statistical distribution,
> rather than the actual variable. This function should accept a set of
> percentile bins and a statistical sketch and a value. It should return the
> index where the percentile of the value falls.
> For instance, consider the value 17 who is percentile 27. If we use 25, 75,
> 95 to define our bins, this function would return 1, because its percentile,
> 27, is between 25 and 75.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)