[ https://issues.apache.org/jira/browse/FLINK-22453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331846#comment-17331846 ]
Yun Tang commented on FLINK-22453: ---------------------------------- We should tell Flink which JM to connect to stop the job, which should be expected behavior. What was the old version you used and had you ever enabled high-availability for previous jobs? > Can not stop job when do not use "-m" option arg > ------------------------------------------------- > > Key: FLINK-22453 > URL: https://issues.apache.org/jira/browse/FLINK-22453 > Project: Flink > Issue Type: Bug > Components: API / Core > Affects Versions: 1.12.2 > Reporter: Liu > Priority: Minor > > flink version: 1.12.2 > yarn version : 3.1.1 (hdp 3.1.5) > h3. Starting a Flink Session on YARN > when i use ' flink stop xxxxxxxxxxx', comond line output: > > ========================================================= > > {noformat} > Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set. > SLF4J: Class path contains multiple SLF4J bindings. > SLF4J: Found binding in > [jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an > explanation. > SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory] > 2021-04-25 16:10:43,369 INFO org.apache.flink.yarn.cli.FlinkYarnSessionCli > [] - Found Yarn properties file under > /tmp/.yarn-properties-flink. > 2021-04-25 16:10:43,369 INFO org.apache.flink.yarn.cli.FlinkYarnSessionCli > [] - Found Yarn properties file under > /tmp/.yarn-properties-flink. > Suspending job "a81c0fe295871ef278a119cd44206216" with a savepoint. > 2021-04-25 16:10:45,126 INFO org.apache.hadoop.yarn.client.AHSProxy > [] - Connecting to Application History server at > adt-bd-c1-nn03.internal/172.20.33.149:10200 > 2021-04-25 16:10:45,174 INFO org.apache.flink.yarn.YarnClusterDescriptor > [] - No path for the flink jar passed. Using the location of > class org.apache.flink.yarn.YarnClusterDescriptor to locate the jar > 2021-04-25 16:10:45,520 INFO org.apache.flink.yarn.YarnClusterDescriptor > [] - Found Web Interface adt-bd-c1-flink06.internal:43379 of > application > 'application_1618023905026_0005'.------------------------------------------------------------ > The program finished with the following > exception:org.apache.flink.util.FlinkException: Could not stop with a > savepoint job "a81c0fe295871ef278a119cd44206216". > at > org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581) > at > org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002) > at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569) > at > org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069) > at > org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730) > at > org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41) > at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132) > Caused by: java.util.concurrent.ExecutionException: > java.util.concurrent.TimeoutException > at > java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) > at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915) > at > org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579) > ... 9 more > Caused by: java.util.concurrent.TimeoutException > at > org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220) > at > org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217) > at > org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) > at > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748){noformat} > > ========================================================= > flink can't stop the job, > but when i user 'flink stop -m jobmanager.server.host:port xxxxx' , it > work well. > '-m' is an option args, The old version does not have this problem > -- This message was sent by Atlassian Jira (v8.3.4#803005)