[
https://issues.apache.org/jira/browse/METRON-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16289951#comment-16289951
]
ASF GitHub Bot commented on METRON-1350:
----------------------------------------
Github user cestella commented on a diff in the pull request:
https://github.com/apache/metron/pull/867#discussion_r156792227
--- Diff: metron-analytics/metron-statistics/README.md ---
@@ -53,6 +53,32 @@ functions can be used from everywhere where Stellar is
used.
* bounds - A list of value bounds (excluding min and max) in sorted
order.
* Returns: Which bin N the value falls in such that bound(N-1) < value <=
bound(N). No min and max bounds are provided, so values smaller than the 0'th
bound go in the 0'th bin, and values greater than the last bound go in the M'th
bin.
+### Sampling Functions
+
+#### `SAMPLE_ADD`
+* Description: Add a value or collection of values to a sampler.
+* Input:
--- End diff --
Sorry, `uniform` here is intended to mean that there's each element has
equal probability of being in the sample (e.g. the probability is pulled from a
[uniform probability
distribution](https://en.wikipedia.org/wiki/Uniform_distribution_(continuous))).
I can probably do a better job documenting.
> Add reservoir sampling functions to Stellar
> -------------------------------------------
>
> Key: METRON-1350
> URL: https://issues.apache.org/jira/browse/METRON-1350
> Project: Metron
> Issue Type: Improvement
> Reporter: Casey Stella
>
> Sampling capabilities would fit very well with the profiler and enable
> algorithms that do not necessarily support our existing probabilistic
> sketches. We should add a reservoir sampler and utilities to merge and
> resample.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)