[
https://issues.apache.org/jira/browse/FLINK-22453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Liu updated FLINK-22453:
------------------------
Description:
flink version: 1.12.2
yarn version : 3.1.1 (hdp 3.1.5)
h3. Starting a Flink Session on YARN
when i use ' flink stop xxxxxxxxxxx', comond line output:
{panel:title=我的标题}
Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.SLF4J:
Class path contains multiple SLF4J bindings.SLF4J: Found binding in
[jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
Found binding in
[jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.SLF4J: Actual binding is of type
[org.apache.logging.slf4j.Log4jLoggerFactory]2021-04-25 16:10:43,369 INFO
org.apache.flink.yarn.cli.FlinkYarnSessionCli [] - Found Yarn
properties file under /tmp/.yarn-properties-flink.2021-04-25 16:10:43,369 INFO
org.apache.flink.yarn.cli.FlinkYarnSessionCli [] - Found Yarn
properties file under /tmp/.yarn-properties-flink.Suspending job
"a81c0fe295871ef278a119cd44206216" with a savepoint.2021-04-25 16:10:45,126
INFO org.apache.hadoop.yarn.client.AHSProxy [] -
Connecting to Application History server at
adt-bd-c1-nn03.internal/172.20.33.149:102002021-04-25 16:10:45,174 INFO
org.apache.flink.yarn.YarnClusterDescriptor [] - No path for
the flink jar passed. Using the location of class
org.apache.flink.yarn.YarnClusterDescriptor to locate the jar2021-04-25
16:10:45,520 INFO org.apache.flink.yarn.YarnClusterDescriptor
[] - Found Web Interface adt-bd-c1-flink06.internal:43379 of application
'application_1618023905026_0005'.
------------------------------------------------------------The program
finished with the following exception:
org.apache.flink.util.FlinkException: Could not stop with a savepoint job
"a81c0fe295871ef278a119cd44206216". at
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581) at
org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569) at
org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069) at
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
at java.security.AccessController.doPrivileged(Native Method) at
javax.security.auth.Subject.doAs(Subject.java:422) at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
at
org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)Caused
by: java.util.concurrent.ExecutionException:
java.util.concurrent.TimeoutException at
java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) at
java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915) at
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579) ...
9 moreCaused by: java.util.concurrent.TimeoutException at
org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
at
org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
at
org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at
java.util.concurrent.FutureTask.run(FutureTask.java:266) at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
{panel}
flink can't stop the job,
but when i user 'flink stop -m jobmanager.server.host:port xxxxx' , it
work well.
'-m' is an option args, The old version does not have this problem
was:
flink version: 1.12.2
yarn version : 3.1.1 (hdp 3.1.5)
h3. Starting a Flink Session on YARN
when i use ' flink stop xxxxxxxxxxx', comond line output:
{code:java}
//代码占位符
Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was set.SLF4J:
Class path contains multiple SLF4J bindings.SLF4J: Found binding in
[jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
Found binding in
[jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.SLF4J: Actual binding is of type
[org.apache.logging.slf4j.Log4jLoggerFactory]2021-04-25 16:10:43,369 INFO
org.apache.flink.yarn.cli.FlinkYarnSessionCli [] - Found Yarn
properties file under /tmp/.yarn-properties-flink.2021-04-25 16:10:43,369 INFO
org.apache.flink.yarn.cli.FlinkYarnSessionCli [] - Found Yarn
properties file under /tmp/.yarn-properties-flink.Suspending job
"a81c0fe295871ef278a119cd44206216" with a savepoint.2021-04-25 16:10:45,126
INFO org.apache.hadoop.yarn.client.AHSProxy [] -
Connecting to Application History server at
adt-bd-c1-nn03.internal/172.20.33.149:102002021-04-25 16:10:45,174 INFO
org.apache.flink.yarn.YarnClusterDescriptor [] - No path for
the flink jar passed. Using the location of class
org.apache.flink.yarn.YarnClusterDescriptor to locate the jar2021-04-25
16:10:45,520 INFO org.apache.flink.yarn.YarnClusterDescriptor
[] - Found Web Interface adt-bd-c1-flink06.internal:43379 of application
'application_1618023905026_0005'.
------------------------------------------------------------ The program
finished with the following exception:
org.apache.flink.util.FlinkException: Could not stop with a savepoint job
"a81c0fe295871ef278a119cd44206216". at
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581) at
org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569) at
org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069) at
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
at java.security.AccessController.doPrivileged(Native Method) at
javax.security.auth.Subject.doAs(Subject.java:422) at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
at
org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)Caused
by: java.util.concurrent.ExecutionException:
java.util.concurrent.TimeoutException at
java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) at
java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915) at
org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579) ...
9 moreCaused by: java.util.concurrent.TimeoutException at
org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
at
org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
at
org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at
java.util.concurrent.FutureTask.run(FutureTask.java:266) at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
{code}
flink can't stop the job,
but when i user 'flink stop -m jobmanager.server.host:port xxxxx' , it
work well.
'-m' is an option args, The old version does not have this problem
> Can not stop job when not ust "-m" option
> -----------------------------------------
>
> Key: FLINK-22453
> URL: https://issues.apache.org/jira/browse/FLINK-22453
> Project: Flink
> Issue Type: Bug
> Components: API / Core
> Affects Versions: 1.12.2
> Reporter: Liu
> Priority: Minor
>
> flink version: 1.12.2
> yarn version : 3.1.1 (hdp 3.1.5)
> h3. Starting a Flink Session on YARN
> when i use ' flink stop xxxxxxxxxxx', comond line output:
> {panel:title=我的标题}
>
> Setting HBASE_CONF_DIR=/etc/hbase/conf because no HBASE_CONF_DIR was
> set.SLF4J: Class path contains multiple SLF4J bindings.SLF4J: Found binding
> in
> [jar:file:/data/flink-1.12.2/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
> Found binding in
> [jar:file:/usr/hdp/3.1.5.0-152/hadoop/lib/slf4j-log4j12-1.7.25.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J:
> See http://www.slf4j.org/codes.html#multiple_bindings for an
> explanation.SLF4J: Actual binding is of type
> [org.apache.logging.slf4j.Log4jLoggerFactory]2021-04-25 16:10:43,369 INFO
> org.apache.flink.yarn.cli.FlinkYarnSessionCli [] - Found Yarn
> properties file under /tmp/.yarn-properties-flink.2021-04-25 16:10:43,369
> INFO org.apache.flink.yarn.cli.FlinkYarnSessionCli [] - Found
> Yarn properties file under /tmp/.yarn-properties-flink.Suspending job
> "a81c0fe295871ef278a119cd44206216" with a savepoint.2021-04-25 16:10:45,126
> INFO org.apache.hadoop.yarn.client.AHSProxy [] -
> Connecting to Application History server at
> adt-bd-c1-nn03.internal/172.20.33.149:102002021-04-25 16:10:45,174 INFO
> org.apache.flink.yarn.YarnClusterDescriptor [] - No path for
> the flink jar passed. Using the location of class
> org.apache.flink.yarn.YarnClusterDescriptor to locate the jar2021-04-25
> 16:10:45,520 INFO org.apache.flink.yarn.YarnClusterDescriptor
> [] - Found Web Interface adt-bd-c1-flink06.internal:43379 of application
> 'application_1618023905026_0005'.
> ------------------------------------------------------------The program
> finished with the following exception:
> org.apache.flink.util.FlinkException: Could not stop with a savepoint job
> "a81c0fe295871ef278a119cd44206216". at
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581)
> at
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
> at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569) at
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069) at
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
> at java.security.AccessController.doPrivileged(Native Method) at
> javax.security.auth.Subject.doAs(Subject.java:422) at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
> at
> org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
> at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)Caused
> by: java.util.concurrent.ExecutionException:
> java.util.concurrent.TimeoutException at
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915) at
> org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579)
> ... 9 moreCaused by: java.util.concurrent.TimeoutException at
> org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1220)
> at
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
> at
> org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
> at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266) at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> {panel}
>
> flink can't stop the job,
> but when i user 'flink stop -m jobmanager.server.host:port xxxxx' , it
> work well.
> '-m' is an option args, The old version does not have this problem
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)