[
https://issues.apache.org/jira/browse/FLINK-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17321778#comment-17321778
]
Flink Jira Bot commented on FLINK-1284:
---------------------------------------
This issue and all of its Sub-Tasks have not been updated for 180 days. So, it
has been labeled "stale-minor". If you are still affected by this bug or are
still interested in this issue, please give an update and remove the label. In
7 days the issue will be closed automatically.
> Uniform random sampling operator over windows
> ---------------------------------------------
>
> Key: FLINK-1284
> URL: https://issues.apache.org/jira/browse/FLINK-1284
> Project: Flink
> Issue Type: New Feature
> Components: API / DataStream
> Reporter: Paris Carbone
> Assignee: Austin Ouyang
> Priority: Minor
> Labels: stale-minor
>
> It would be useful for several use cases to have a built-in uniform random
> sampling operator in the streaming API that can operate on windows. This can
> be used for example for online machine learning operations, evaluating
> heuristics or continuous visualisation of representative values.
> The operator could be given a field and a number of random samples needed,
> following a window statement as such:
> mystream.window(..).sample(fieldID,#samples)
> Given that pre-aggregation is enabled, this could perhaps be implemented as a
> binary reduce operator or a combinable groupreduce that pre-aggregates the
> empiricals of that field.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)