[ 
https://issues.apache.org/jira/browse/FLINK-22453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331846#comment-17331846
 ] 

Yun Tang commented on FLINK-22453:
----------------------------------

We should tell Flink which JM to connect to stop the job, which should be 
expected behavior. What was the old version you used and had you ever enabled 
high-availability for previous jobs?

> Can not stop job when  do not use "-m" option arg
> -------------------------------------------------
>
>                 Key: FLINK-22453
>                 URL: https://issues.apache.org/jira/browse/FLINK-22453
>             Project: Flink
>          Issue Type: Bug
>          Components: API / Core
>    Affects Versions: 1.12.2
>            Reporter: Liu
>            Priority: Minor
>
> flink version: 1.12.2
> yarn version : 3.1.1 (hdp 3.1.5)
> h3. Starting a Flink Session on YARN
> when i use ' flink stop xxxxxxxxxxx',   comond line output:
>  
> =========================================================
>  
> {noformat}
> Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in 
> [jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in 
> [jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
> explanation.
> SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
> 2021-04-25 16:10:43,369 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli   
>              [] - Found Yarn properties file under 
> /tmp/.yarn-properties-flink.
> 2021-04-25 16:10:43,369 INFO  org.apache.flink.yarn.cli.FlinkYarnSessionCli   
>              [] - Found Yarn properties file under 
> /tmp/.yarn-properties-flink.
> Suspending job "a81c0fe295871ef278a119cd44206216" with a savepoint.
> 2021-04-25 16:10:45,126 INFO  org.apache.hadoop.yarn.client.AHSProxy          
>              [] - Connecting to Application History server at 
> adt-bd-c1-nn03.internal/172.20.33.149:10200
> 2021-04-25 16:10:45,174 INFO  org.apache.flink.yarn.YarnClusterDescriptor     
>              [] - No path for the flink jar passed. Using the location of 
> class org.apache.flink.yarn.YarnClusterDescriptor to locate the jar
> 2021-04-25 16:10:45,520 INFO  org.apache.flink.yarn.YarnClusterDescriptor     
>              [] - Found Web Interface adt-bd-c1-flink06.internal:43379 of 
> application 
> 'application_1618023905026_0005'.------------------------------------------------------------
> The program finished with the following 
> exception:org.apache.flink.util.FlinkException: Could not stop with a 
> savepoint job "a81c0fe295871ef278a119cd44206216".
>     at 
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581)
>     at 
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
>     at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569)
>     at 
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069)
>     at 
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
>     at java.security.AccessController.doPrivileged(Native Method)
>     at javax.security.auth.Subject.doAs(Subject.java:422)
>     at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
>     at 
> org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>     at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
> Caused by: java.util.concurrent.ExecutionException: 
> java.util.concurrent.TimeoutException
>     at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
>     at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915)
>     at 
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579)
>     ... 9 more
> Caused by: java.util.concurrent.TimeoutException
>     at 
> org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
>     at 
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
>     at 
> org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
>     at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
>     at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>     at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>     at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>     at java.lang.Thread.run(Thread.java:748){noformat}
>  
> =========================================================
>  flink can't stop the job,  
> but when i user  'flink stop -m jobmanager.server.host:port    xxxxx'  ,  it 
> work well.
> '-m' is an option args,  The old version does not have this problem
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to