[ 
https://issues.apache.org/jira/browse/FLINK-15297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998777#comment-16998777
 ] 

Yang Wang commented on FLINK-15297:
-----------------------------------

[~tison] I'm not sure how this could happen? Does the flink application on Yarn 
finished very quickly that when `YarnClusterDescriptor` first get the 
application report, the state is "FINISHED". Also i think it could not happen 
in 250ms that application state changes from "ACCEPTED" to "FINISHED".

> Do not throw exception if YARN Application switched to FINISHED immediately 
> after deployed in YarnClusterDescriptor#startAppMaster
> ----------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-15297
>                 URL: https://issues.apache.org/jira/browse/FLINK-15297
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN
>            Reporter: Zili Chen
>            Assignee: Zili Chen
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.11.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Currently we throw an exception in {{YarnClusterDescriptor#startAppMaster}} 
> if we first detect {{FINISHED}} before {{RUNNING}}. However, it is possible a 
> legal state that the application finished normally immediately.
> Right now we always try to connect the Dispatcher so it may be fine to throw 
> the exception a bit earlier(otherwise when connect to a closed cluster an 
> exception thrown also), but it is semantically wrong. Internally we have a 
> code path that only required to report the ApplicationReport and it causes 
> trouble.
> cc [~trohrmann] what do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to