[ 
https://issues.apache.org/jira/browse/FLINK-19358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17450518#comment-17450518
 ] 

Till Rohrmann commented on FLINK-19358:
---------------------------------------

Ah ok, I think I know understand the idea. Under the assumption that there is 
only a single job, we don't have to match on the {{JobID}} but can simply check 
whether there is a job that is recovered. In this case, the solution probably 
works.

What I am wondering is how this will fit together with 
[FLIP-194|https://cwiki.apache.org/confluence/x/Y4yqCw] that will introduce a 
{{JobResultStore}}. With this component, it could become possible to support HA 
for multi job applications. With the {{JobResultStore}} and assuming 
determinism of the {{main()}} function and deterministic {{JobIDs}} I think it 
should be possible to support HA.

If we now introduce a special case, then this might entail a breaking behaviour 
change (zero {{JobID}} -> random {{JobID}} -> deterministic {{JobID}}).

A related question is how do we explain the limitation of a single job per 
application if they want to use HA in the long term? 

> when submit job on application mode with HA,the jobid will be 0000000000
> ------------------------------------------------------------------------
>
>                 Key: FLINK-19358
>                 URL: https://issues.apache.org/jira/browse/FLINK-19358
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Coordination
>    Affects Versions: 1.11.0
>            Reporter: Jun Zhang
>            Priority: Minor
>              Labels: auto-deprioritized-major, usability
>
> when submit a flink job on application mode with HA ,the flink job id will beĀ 
> 00000000000000000000000000000000, when I have many jobs ,they have the same 
> job id , it will be lead to a checkpoint error



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to