[ https://issues.apache.org/jira/browse/FLINK-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Flink Jira Bot updated FLINK-1284: ---------------------------------- Labels: stale-assigned stale-minor (was: stale-minor) > Uniform random sampling operator over windows > --------------------------------------------- > > Key: FLINK-1284 > URL: https://issues.apache.org/jira/browse/FLINK-1284 > Project: Flink > Issue Type: New Feature > Components: API / DataStream > Reporter: Paris Carbone > Assignee: Austin Ouyang > Priority: Minor > Labels: stale-assigned, stale-minor > > It would be useful for several use cases to have a built-in uniform random > sampling operator in the streaming API that can operate on windows. This can > be used for example for online machine learning operations, evaluating > heuristics or continuous visualisation of representative values. > The operator could be given a field and a number of random samples needed, > following a window statement as such: > mystream.window(..).sample(fieldID,#samples) > Given that pre-aggregation is enabled, this could perhaps be implemented as a > binary reduce operator or a combinable groupreduce that pre-aggregates the > empiricals of that field. -- This message was sent by Atlassian Jira (v8.3.4#803005)