[ 
https://issues.apache.org/jira/browse/FLINK-2283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14743686#comment-14743686
 ] 

Stephan Ewen commented on FLINK-2283:
-------------------------------------

Aggregating is a group operation, group operations need windows. That is one of 
the fundamental concepts in streaming, no?
Programs run forever (or at least years, which is infinitely for all practical 
matters). Any form of gathering state needs a clear specification when state is 
discarded, otherwise it is not practically usable. This "expiry" is exactly the 
window.

We have a user on the mailing list right now that is running a Flink 
installation and has a setup where continuously new keys arrive. The state 
grows infinitely, the program is unusable.

This particular case would need something like a "trigger each time, evict 
every hour" window or so. Why not treat it like that and tool around it to make 
it very easily usable (i.e., with pre-defined policies)?

> Make grouped reduce/fold/aggregations stateful using Partitioned state
> ----------------------------------------------------------------------
>
>                 Key: FLINK-2283
>                 URL: https://issues.apache.org/jira/browse/FLINK-2283
>             Project: Flink
>          Issue Type: Improvement
>          Components: Streaming
>    Affects Versions: 0.10
>            Reporter: Gyula Fora
>            Assignee: Márton Balassi
>            Priority: Minor
>
> Currently the inner state of the grouped aggregations are not persisted as an 
> operator state. 
> These operators should be reimplemented to use the newly introduced 
> partitioned state abstractions which will make them fault tolerant and 
> scalable for the future.
> A suggested implementation would be to use a stateful mapper to implement the 
> desired behaviour.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to