[
https://issues.apache.org/jira/browse/METRON-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16289926#comment-16289926
]
ASF GitHub Bot commented on METRON-1350:
----------------------------------------
Github user simonellistonball commented on the issue:
https://github.com/apache/metron/pull/867
Should the size limit on the sample really be a cut off? In a likely usage
scenario a users would sample over a window in a profile. Limiting the size is
likely to skew to time at the beginning of the window rather than being
genuinely uniform. Would a random replacement strategy make more sense when
over the limit? This could be a lot heavier in terms of performance, but may be
more mathematically sound.
> Add reservoir sampling functions to Stellar
> -------------------------------------------
>
> Key: METRON-1350
> URL: https://issues.apache.org/jira/browse/METRON-1350
> Project: Metron
> Issue Type: Improvement
> Reporter: Casey Stella
>
> Sampling capabilities would fit very well with the profiler and enable
> algorithms that do not necessarily support our existing probabilistic
> sketches. We should add a reservoir sampler and utilities to merge and
> resample.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)