[
https://issues.apache.org/jira/browse/FLINK-22453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17331846#comment-17331846
]
Yun Tang commented on FLINK-22453:
----------------------------------
We should tell Flink which JM to connect to stop the job, which should be
expected behavior. What was the old version you used and had you ever enabled
high-availability for previous jobs?
> Can not stop job when do not use "-m" option arg
> -------------------------------------------------
>
> Key: FLINK-22453
> URL: https://issues.apache.org/jira/browse/FLINK-22453
> Project: Flink
> Issue Type: Bug
> Components: API / Core
> Affects Versions: 1.12.2
> Reporter: Liu
> Priority: Minor
>
> flink version: 1.12.2
> yarn version : 3.1.1 (hdp 3.1.5)
> h3. Starting a Flink Session on YARN
> when i use ' flink stop xxxxxxxxxxx', comond line output:
>
> =========================================================
>
> {noformat}
> Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in
> [jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
> explanation.
> SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
> 2021-04-25 16:10:43,369 INFO org.apache.flink.yarn.cli.FlinkYarnSessionCli
> [] - Found Yarn properties file under
> /tmp/.yarn-properties-flink.
> 2021-04-25 16:10:43,369 INFO org.apache.flink.yarn.cli.FlinkYarnSessionCli
> [] - Found Yarn properties file under
> /tmp/.yarn-properties-flink.
> Suspending job "a81c0fe295871ef278a119cd44206216" with a savepoint.
> 2021-04-25 16:10:45,126 INFO org.apache.hadoop.yarn.client.AHSProxy
> [] - Connecting to Application History server at
> adt-bd-c1-nn03.internal/172.20.33.149:10200
> 2021-04-25 16:10:45,174 INFO org.apache.flink.yarn.YarnClusterDescriptor
> [] - No path for the flink jar passed. Using the location of
> class org.apache.flink.yarn.YarnClusterDescriptor to locate the jar
> 2021-04-25 16:10:45,520 INFO org.apache.flink.yarn.YarnClusterDescriptor
> [] - Found Web Interface adt-bd-c1-flink06.internal:43379 of
> application
> 'application_1618023905026_0005'.------------------------------------------------------------
> The program finished with the following
> exception:org.apache.flink.util.FlinkException: Could not stop with a
> savepoint job "a81c0fe295871ef278a119cd44206216".
> at
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581)
> at
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
> at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569)
> at
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069)
> at
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:422)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
> at
> org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
> at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
> Caused by: java.util.concurrent.ExecutionException:
> java.util.concurrent.TimeoutException
> at
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915)
> at
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579)
> ... 9 more
> Caused by: java.util.concurrent.TimeoutException
> at
> org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
> at
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
> at
> org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
> at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748){noformat}
>
> =========================================================
> flink can't stop the job,
> but when i user 'flink stop -m jobmanager.server.host:port xxxxx' , it
> work well.
> '-m' is an option args, The old version does not have this problem
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)