[ 
https://issues.apache.org/jira/browse/BEAM-6077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16695713#comment-16695713
 ] 

Jozef Vilcek commented on BEAM-6077:
------------------------------------

Ehm, sorry for mixed up names :)

I have a git of a headache to wrap my head around the behavior in 
UndoundedSourceWrapper. Important methods seems to be initializeState(), 
snapshotState() and open(). Am I correct assuming that snapshotState() should 
add to the ListState partitions that current running instance governs and 
during the initalizeState() and open(), UnboundedSourceWrapper have access to 
all ListState elements from all parallel unbounded sources, so it can pick 
other partitions that they were not assigned before. If this is true, it is a 
feature of a ListState, because of some global Key? 

Anyway, if yes then doing splitting according to MAX_PARALLELISM sounds like a 
great idea. Did not notice at fist List `localSplitSources`. I will try it out.

> Make UnboundedSource state rescale friendly
> -------------------------------------------
>
>                 Key: BEAM-6077
>                 URL: https://issues.apache.org/jira/browse/BEAM-6077
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-flink
>    Affects Versions: 2.8.0
>            Reporter: Jozef Vilcek
>            Assignee: Jozef Vilcek
>            Priority: Major
>
> FlinkRunner's UnboundedSourceWrapper currently does not rescale well it's 
> state when job changes parallelism.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to