[
https://issues.apache.org/jira/browse/FLINK-21647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17413242#comment-17413242
]
Kevin commented on FLINK-21647:
-------------------------------
Hello [~wangyang0918], thanks for you reply!
My goal is to run Kafka 2.8.0 and Flink 1.13.2 on Amazone Kubernetes Service
1.20.7 (AKS). AKS seems to run fine and Kafka (which I deployed via Strimzi)
can produce and consume message via console. However, I now struggle with
staring a Flink session with following command:
{{}}
{code:java}
./bin/kubernetes-session.sh -Dkubernetes.cluster-id=my-first-flink-cluster{code}
{{Do the logging logs provide meangingful insights to you?}}
{{BR,}}
{{Kevin}}
----
{{Checking status of pods:}}
{code:java}
kevin@road-condition-vm-flink-client:~$ kubectl get pods
NAME READY STATUS RESTARTS AGE
my-first-flink-clustercd-6d59756c7c-9fb7s 0/1 CrashLoopBackOff 167 14h
road-condition-kafka-kafka-0 1/1 Running 0 11d
road-condition-kafka-zookeeper-0 1/1 Running 0 11d
strimzi-cluster-operator-687fdd6f77-24ccn 1/1 Running 7 11d
{code}
Checking status of services (flink monitoring service is not accessible even
both is provided, internal & external IP) – I have masked the external IP in
the code below:
{code:java}
kevin@road-condition-vm-flink-client:~$ kubectl get services
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
kubernetes ClusterIP 10.0.0.1 <none> 443/TCP 31d
my-first-flink-clustercd ClusterIP None <none> 6123/TCP,6124/TCP 14h
my-first-flink-clustercd-rest LoadBalancer 10.0.138.102 20.XX.XX.0
8081:32603/TCP 14h
road-condition-kafka-kafka-bootstrap ClusterIP 10.0.99.147 <none>
9091/TCP,9092/TCP 11d
road-condition-kafka-kafka-brokers ClusterIP None <none>
9090/TCP,9091/TCP,9092/TCP 11d
road-condition-kafka-zookeeper-client ClusterIP 10.0.59.247 <none> 2181/TCP 11d
road-condition-kafka-zookeeper-nodes ClusterIP None <none>
2181/TCP,2888/TCP,3888/TCP 11d{code}
Checking logs of flink session pod:
{code:java}
kevin@road-condition-vm-flink-client:~$ kubectl logs
my-first-flink-clustercd-6d59756c7c-9fb7s
sed: couldn't open temporary file /opt/flink/conf/sed0NT8SI: Read-only file
system
sed: couldn't open temporary file /opt/flink/conf/sed5rHenH: Read-only file
system
/docker-entrypoint.sh: line 73: /opt/flink/conf/flink-conf.yaml: Read-only file
system
sed: couldn't open temporary file /opt/flink/conf/sedi357bL: Read-only file
system
/docker-entrypoint.sh: line 88: /opt/flink/conf/flink-conf.yaml.tmp: Read-only
file system
Starting kubernetes-session as a console application on host
my-first-flink-clustercd-6d59756c7c-9fb7s.
2021-09-10 15:14:04,524 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
--------------------------------------------------------------------------------
2021-09-10 15:14:04,527 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Preconfiguration:
2021-09-10 15:14:04,527 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
RESOURCE_PARAMS extraction logs:
jvm_params: -Xmx1073741824 -Xms1073741824 -XX:MaxMetaspaceSize=268435456
dynamic_configs: -D jobmanager.memory.off-heap.size=134217728b -D
jobmanager.memory.jvm-overhead.min=201326592b -D
jobmanager.memory.jvm-metaspace.size=268435456b -D
jobmanager.memory.heap.size=1073741824b -D
jobmanager.memory.jvm-overhead.max=201326592b
logs: INFO [] - Loading configuration property: blob.server.port, 6124
INFO [] - Loading configuration property: taskmanager.memory.process.size, 1728m
INFO [] - Loading configuration property:
kubernetes.internal.jobmanager.entrypoint.class,
org.apache.flink.kubernetes.entrypoint.KubernetesSessionClusterEntrypoint
INFO [] - Loading configuration property:
jobmanager.execution.failover-strategy, region
INFO [] - Loading configuration property: jobmanager.rpc.address,
my-first-flink-clustercd.default
INFO [] - Loading configuration property: execution.target, kubernetes-session
INFO [] - Loading configuration property: jobmanager.memory.process.size, 1600m
INFO [] - Loading configuration property: jobmanager.rpc.port, 6123
INFO [] - Loading configuration property: kubernetes.cluster-id,
my-first-flink-clustercd
INFO [] - Loading configuration property: taskmanager.rpc.port, 6122
INFO [] - Loading configuration property: internal.cluster.execution-mode,
NORMAL
INFO [] - Loading configuration property: parallelism.default, 1
INFO [] - Loading configuration property: taskmanager.numberOfTaskSlots, 1
INFO [] - The derived from fraction jvm overhead memory (160.000mb (167772162
bytes)) is less than its min value 192.000mb (201326592 bytes), min value will
be used instead
INFO [] - Final Master Memory configuration:
INFO [] - Total Process Memory: 1.563gb (1677721600 bytes)
INFO [] - Total Flink Memory: 1.125gb (1207959552 bytes)
INFO [] - JVM Heap: 1024.000mb (1073741824 bytes)
INFO [] - Off-heap: 128.000mb (134217728 bytes)
INFO [] - JVM Metaspace: 256.000mb (268435456 bytes)
INFO [] - JVM Overhead: 192.000mb (201326592 bytes)
2021-09-10 15:14:04,528 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
--------------------------------------------------------------------------------
2021-09-10 15:14:04,528 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Starting
KubernetesSessionClusterEntrypoint (Version: 1.13.2, Scala: 2.11, Rev:5f007ff,
Date:2021-07-23T04:35:55+02:00)
2021-09-10 15:14:04,529 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - OS current user:
flink
2021-09-10 15:14:04,529 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Current
Hadoop/Kerberos user: <no hadoop dependency found>
2021-09-10 15:14:04,529 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - JVM: OpenJDK 64-Bit
Server VM - Oracle Corporation - 1.8/25.302-b08
2021-09-10 15:14:04,529 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Maximum heap size:
989 MiBytes
2021-09-10 15:14:04,529 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - JAVA_HOME:
/usr/local/openjdk-8
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - No Hadoop Dependency
available
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - JVM Options:
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -Xmx1073741824
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -Xms1073741824
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
-XX:MaxMetaspaceSize=268435456
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
-Dlog.file=/opt/flink/log/flink--kubernetes-session-0-my-first-flink-clustercd-6d59756c7c-9fb7s.log
2021-09-10 15:14:04,530 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
-Dlog4j.configuration=file:/opt/flink/conf/log4j-console.properties
2021-09-10 15:14:04,531 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
-Dlog4j.configurationFile=file:/opt/flink/conf/log4j-console.properties
2021-09-10 15:14:04,531 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
-Dlogback.configurationFile=file:/opt/flink/conf/logback-console.xml
2021-09-10 15:14:04,531 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Program Arguments:
2021-09-10 15:14:04,532 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -D
2021-09-10 15:14:04,532 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
jobmanager.memory.off-heap.size=134217728b
2021-09-10 15:14:04,532 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -D
2021-09-10 15:14:04,532 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
jobmanager.memory.jvm-overhead.min=201326592b
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -D
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
jobmanager.memory.jvm-metaspace.size=268435456b
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -D
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
jobmanager.memory.heap.size=1073741824b
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - -D
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
jobmanager.memory.jvm-overhead.max=201326592b
2021-09-10 15:14:04,533 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Classpath:
/opt/flink/lib/flink-csv-1.13.2.jar:/opt/flink/lib/flink-json-1.13.2.jar:/opt/flink/lib/flink-shaded-zookeeper-3.4.14.jar:/opt/flink/lib/flink-table-blink_2.11-1.13.2.jar:/opt/flink/lib/flink-table_2.11-1.13.2.jar:/opt/flink/lib/log4j-1.2-api-2.12.1.jar:/opt/flink/lib/log4j-api-2.12.1.jar:/opt/flink/lib/log4j-core-2.12.1.jar:/opt/flink/lib/log4j-slf4j-impl-2.12.1.jar:/opt/flink/lib/flink-dist_2.11-1.13.2.jar:::
2021-09-10 15:14:04,534 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] -
--------------------------------------------------------------------------------
2021-09-10 15:14:04,535 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Registered UNIX
signal handlers for [TERM, HUP, INT]
2021-09-10 15:14:04,547 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: blob.server.port, 6124
2021-09-10 15:14:04,547 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: taskmanager.memory.process.size, 1728m
2021-09-10 15:14:04,547 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property:
kubernetes.internal.jobmanager.entrypoint.class,
org.apache.flink.kubernetes.entrypoint.KubernetesSessionClusterEntrypoint
2021-09-10 15:14:04,548 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.execution.failover-strategy,
region
2021-09-10 15:14:04,548 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.rpc.address,
my-first-flink-clustercd.default
2021-09-10 15:14:04,548 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: execution.target, kubernetes-session
2021-09-10 15:14:04,548 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.memory.process.size, 1600m
2021-09-10 15:14:04,548 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.rpc.port, 6123
2021-09-10 15:14:04,548 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: kubernetes.cluster-id,
my-first-flink-clustercd
2021-09-10 15:14:04,549 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: taskmanager.rpc.port, 6122
2021-09-10 15:14:04,549 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: internal.cluster.execution-mode, NORMAL
2021-09-10 15:14:04,549 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: parallelism.default, 1
2021-09-10 15:14:04,549 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: taskmanager.numberOfTaskSlots, 1
2021-09-10 15:14:04,619 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Starting
KubernetesSessionClusterEntrypoint.
2021-09-10 15:14:04,659 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Install default
filesystem.
2021-09-10 15:14:04,709 INFO org.apache.flink.core.fs.FileSystem [] - Hadoop is
not in the classpath/dependencies. The extended set of supported File Systems
via Hadoop is not available.
2021-09-10 15:14:04,743 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Install security
context.
2021-09-10 15:14:04,754 INFO
org.apache.flink.runtime.security.modules.HadoopModuleFactory [] - Cannot
create Hadoop Security Module because Hadoop cannot be found in the Classpath.
2021-09-10 15:14:04,758 INFO
org.apache.flink.runtime.security.modules.JaasModule [] - Jaas file will be
created as /tmp/jaas-5459458947931074872.conf.
2021-09-10 15:14:04,810 INFO
org.apache.flink.runtime.security.contexts.HadoopSecurityContextFactory [] -
Cannot install HadoopSecurityContext because Hadoop cannot be found in the
Classpath.
2021-09-10 15:14:04,811 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Initializing cluster
services.
2021-09-10 15:14:04,833 INFO
org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Trying to start
actor system, external address my-first-flink-clustercd.default:6123, bind
address 0.0.0.0:6123.
2021-09-10 15:14:05,635 INFO akka.event.slf4j.Slf4jLogger [] - Slf4jLogger
started
2021-09-10 15:14:05,708 INFO akka.remote.Remoting [] - Starting remoting
2021-09-10 15:14:05,921 INFO akka.remote.Remoting [] - Remoting started;
listening on addresses :[akka.tcp://[email protected]:6123]
2021-09-10 15:14:06,036 INFO
org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Actor system started
at akka.tcp://[email protected]:6123
2021-09-10 15:14:06,116 INFO org.apache.flink.configuration.Configuration [] -
Config uses fallback configuration key 'jobmanager.rpc.address' instead of key
'rest.address'
2021-09-10 15:14:06,124 INFO org.apache.flink.runtime.blob.BlobServer [] -
Created BLOB server storage directory
/tmp/blobStore-951adff6-057f-4c4e-a15e-150904156cc0
2021-09-10 15:14:06,131 INFO org.apache.flink.runtime.blob.BlobServer [] -
Started BLOB server at 0.0.0.0:6124 - max concurrent requests: 50 - max
backlog: 1000
2021-09-10 15:14:06,162 INFO
org.apache.flink.runtime.metrics.MetricRegistryImpl [] - No metrics reporter
configured, no metrics will be exposed/reported.
2021-09-10 15:14:06,207 INFO
org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Trying to start
actor system, external address my-first-flink-clustercd.default:0, bind address
0.0.0.0:0.
2021-09-10 15:14:06,244 INFO akka.event.slf4j.Slf4jLogger [] - Slf4jLogger
started
2021-09-10 15:14:06,251 INFO akka.remote.Remoting [] - Starting remoting
2021-09-10 15:14:06,313 INFO akka.remote.Remoting [] - Remoting started;
listening on addresses
:[akka.tcp://[email protected]:36611]
2021-09-10 15:14:06,324 INFO
org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Actor system started
at akka.tcp://[email protected]:36611
2021-09-10 15:14:06,344 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService
[] - Starting RPC endpoint for
org.apache.flink.runtime.metrics.dump.MetricQueryService at
akka://flink-metrics/user/rpc/MetricQueryService .
2021-09-10 15:14:06,430 INFO
org.apache.flink.runtime.dispatcher.FileExecutionGraphInfoStore [] -
Initializing FileExecutionGraphInfoStore: Storage directory
/tmp/executionGraphStore-388e1bbd-4263-4a3b-a921-4afd2164ab27, expiration time
3600000, maximum cache size 52428800 bytes.
2021-09-10 15:14:06,534 INFO org.apache.flink.configuration.Configuration [] -
Config uses fallback configuration key 'jobmanager.rpc.address' instead of key
'rest.address'
2021-09-10 15:14:06,535 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Upload
directory /tmp/flink-web-820a0acb-c0ad-4bc0-871b-0fb546f21993/flink-web-upload
does not exist.
2021-09-10 15:14:06,536 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Created
directory /tmp/flink-web-820a0acb-c0ad-4bc0-871b-0fb546f21993/flink-web-upload
for file uploads.
2021-09-10 15:14:06,537 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Starting rest
endpoint.
2021-09-10 15:14:07,046 INFO
org.apache.flink.runtime.webmonitor.WebMonitorUtils [] - Determined location of
main cluster component log file:
/opt/flink/log/flink--kubernetes-session-0-my-first-flink-clustercd-6d59756c7c-9fb7s.log
2021-09-10 15:14:07,046 INFO
org.apache.flink.runtime.webmonitor.WebMonitorUtils [] - Determined location of
main cluster component stdout file:
/opt/flink/log/flink--kubernetes-session-0-my-first-flink-clustercd-6d59756c7c-9fb7s.out
2021-09-10 15:14:07,278 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Rest endpoint
listening at my-first-flink-clustercd.default:8081
2021-09-10 15:14:07,279 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] -
http://my-first-flink-clustercd.default:8081 was granted leadership with
leaderSessionID=00000000-0000-0000-0000-000000000000
2021-09-10 15:14:07,306 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Web frontend
listening at http://my-first-flink-clustercd.default:8081.
2021-09-10 15:14:07,330 INFO
org.apache.flink.runtime.util.config.memory.ProcessMemoryUtils [] - The derived
from fraction jvm overhead memory (172.800mb (181193935 bytes)) is less than
its min value 192.000mb (201326592 bytes), min value will be used instead
2021-09-10 15:14:08,109 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: blob.server.port, 6124
2021-09-10 15:14:08,110 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: taskmanager.memory.process.size, 1728m
2021-09-10 15:14:08,110 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property:
kubernetes.internal.jobmanager.entrypoint.class,
org.apache.flink.kubernetes.entrypoint.KubernetesSessionClusterEntrypoint
2021-09-10 15:14:08,110 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.execution.failover-strategy,
region
2021-09-10 15:14:08,111 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.rpc.address,
my-first-flink-clustercd.default
2021-09-10 15:14:08,112 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: execution.target, kubernetes-session
2021-09-10 15:14:08,113 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.memory.process.size, 1600m
2021-09-10 15:14:08,113 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: jobmanager.rpc.port, 6123
2021-09-10 15:14:08,114 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: kubernetes.cluster-id,
my-first-flink-clustercd
2021-09-10 15:14:08,115 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: taskmanager.rpc.port, 6122
2021-09-10 15:14:08,116 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: internal.cluster.execution-mode, NORMAL
2021-09-10 15:14:08,117 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: parallelism.default, 1
2021-09-10 15:14:08,117 INFO org.apache.flink.configuration.GlobalConfiguration
[] - Loading configuration property: taskmanager.numberOfTaskSlots, 1
2021-09-10 15:14:08,122 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService
[] - Starting RPC endpoint for
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager at
akka://flink/user/rpc/resourcemanager_0 .
2021-09-10 15:14:08,143 INFO
org.apache.flink.runtime.dispatcher.runner.SessionDispatcherLeaderProcess [] -
Start SessionDispatcherLeaderProcess.
2021-09-10 15:14:08,148 INFO
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] -
Starting the resource manager.
2021-09-10 15:14:08,208 INFO
org.apache.flink.runtime.dispatcher.runner.SessionDispatcherLeaderProcess [] -
Recover all persisted job graphs.
2021-09-10 15:14:08,209 INFO
org.apache.flink.runtime.dispatcher.runner.SessionDispatcherLeaderProcess [] -
Successfully recovered 0 persisted job graphs.
2021-09-10 15:14:08,245 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService
[] - Starting RPC endpoint for
org.apache.flink.runtime.dispatcher.StandaloneDispatcher at
akka://flink/user/rpc/dispatcher_1 .
2021-09-10 15:14:09,146 WARN
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager [] - Exec
Failure: HTTP 403, Status: 403 - pods is forbidden: User
"system:serviceaccount:default:default" cannot watch resource "pods" in API
group "" in the namespace "default"
java.net.ProtocolException: Expected HTTP 101 response but was '403 Forbidden'
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket.checkResponse(RealWebSocket.java:229)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:196)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.RealCall$AsyncCall.execute(RealCall.java:206)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
[?:1.8.0_302]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
[?:1.8.0_302]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_302]
2021-09-10 15:14:09,207 INFO
org.apache.flink.kubernetes.kubeclient.resources.KubernetesPodsWatcher [] - The
watcher is closing.
2021-09-10 15:14:09,208 INFO
org.apache.flink.runtime.resourcemanager.slotmanager.DeclarativeSlotManager []
- Closing the slot manager.
2021-09-10 15:14:09,209 ERROR
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] -
Fatal error occurred in ResourceManager.
org.apache.flink.runtime.resourcemanager.exceptions.ResourceManagerException:
Could not start the ResourceManager
akka.tcp://[email protected]:6123/user/rpc/resourcemanager_0
at
org.apache.flink.runtime.resourcemanager.ResourceManager.onStart(ResourceManager.java:239)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStart(RpcEndpoint.java:181)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StoppedState.start(AkkaRpcActor.java:605)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:180)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.invoke(ActorCell.scala:561)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.run(Mailbox.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
[flink-dist_2.11-1.13.2.jar:1.13.2]
Caused by:
org.apache.flink.runtime.resourcemanager.exceptions.ResourceManagerException:
Cannot initialize resource provider.
at
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager.initialize(ActiveResourceManager.java:156)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.startResourceManagerServices(ResourceManager.java:251)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.onStart(ResourceManager.java:235)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
... 20 more
Caused by: io.fabric8.kubernetes.client.KubernetesClientException: pods is
forbidden: User "system:serviceaccount:default:default" cannot watch resource
"pods" in API group "" in the namespace "default"
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager$1.onFailure(WatchConnectionManager.java:203)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket.failWebSocket(RealWebSocket.java:571)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:198)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.RealCall$AsyncCall.execute(RealCall.java:206)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
~[?:1.8.0_302]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
~[?:1.8.0_302]
at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_302]
Suppressed: java.lang.Throwable: waiting here
at io.fabric8.kubernetes.client.utils.Utils.waitUntilReady(Utils.java:144)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager.waitUntilReady(WatchConnectionManager.java:341)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.base.BaseOperation.watch(BaseOperation.java:755)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.base.BaseOperation.watch(BaseOperation.java:739)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.base.BaseOperation.watch(BaseOperation.java:70)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.kubeclient.Fabric8FlinkKubeClient.watchPodsAndDoCallback(Fabric8FlinkKubeClient.java:227)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.KubernetesResourceManagerDriver.watchTaskManagerPods(KubernetesResourceManagerDriver.java:331)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.KubernetesResourceManagerDriver.initializeInternal(KubernetesResourceManagerDriver.java:103)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.active.AbstractResourceManagerDriver.initialize(AbstractResourceManagerDriver.java:81)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager.initialize(ActiveResourceManager.java:154)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.startResourceManagerServices(ResourceManager.java:251)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.onStart(ResourceManager.java:235)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStart(RpcEndpoint.java:181)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StoppedState.start(AkkaRpcActor.java:605)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:180)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.invoke(ActorCell.scala:561)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.run(Mailbox.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
[flink-dist_2.11-1.13.2.jar:1.13.2]
2021-09-10 15:14:09,214 ERROR
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Fatal error occurred
in the cluster entrypoint.
org.apache.flink.runtime.resourcemanager.exceptions.ResourceManagerException:
Could not start the ResourceManager
akka.tcp://[email protected]:6123/user/rpc/resourcemanager_0
at
org.apache.flink.runtime.resourcemanager.ResourceManager.onStart(ResourceManager.java:239)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStart(RpcEndpoint.java:181)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StoppedState.start(AkkaRpcActor.java:605)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:180)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.invoke(ActorCell.scala:561)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.run(Mailbox.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
[flink-dist_2.11-1.13.2.jar:1.13.2]
Caused by:
org.apache.flink.runtime.resourcemanager.exceptions.ResourceManagerException:
Cannot initialize resource provider.
at
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager.initialize(ActiveResourceManager.java:156)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.startResourceManagerServices(ResourceManager.java:251)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.onStart(ResourceManager.java:235)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
... 20 more
Caused by: io.fabric8.kubernetes.client.KubernetesClientException: pods is
forbidden: User "system:serviceaccount:default:default" cannot watch resource
"pods" in API group "" in the namespace "default"
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager$1.onFailure(WatchConnectionManager.java:203)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket.failWebSocket(RealWebSocket.java:571)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:198)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.RealCall$AsyncCall.execute(RealCall.java:206)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
~[?:1.8.0_302]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
~[?:1.8.0_302]
at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_302]
Suppressed: java.lang.Throwable: waiting here
at io.fabric8.kubernetes.client.utils.Utils.waitUntilReady(Utils.java:144)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager.waitUntilReady(WatchConnectionManager.java:341)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.base.BaseOperation.watch(BaseOperation.java:755)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.base.BaseOperation.watch(BaseOperation.java:739)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
io.fabric8.kubernetes.client.dsl.base.BaseOperation.watch(BaseOperation.java:70)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.kubeclient.Fabric8FlinkKubeClient.watchPodsAndDoCallback(Fabric8FlinkKubeClient.java:227)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.KubernetesResourceManagerDriver.watchTaskManagerPods(KubernetesResourceManagerDriver.java:331)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.kubernetes.KubernetesResourceManagerDriver.initializeInternal(KubernetesResourceManagerDriver.java:103)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.active.AbstractResourceManagerDriver.initialize(AbstractResourceManagerDriver.java:81)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager.initialize(ActiveResourceManager.java:154)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.startResourceManagerServices(ResourceManager.java:251)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.resourcemanager.ResourceManager.onStart(ResourceManager.java:235)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.RpcEndpoint.internalCallOnStart(RpcEndpoint.java:181)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor$StoppedState.start(AkkaRpcActor.java:605)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at
org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleControlMessage(AkkaRpcActor.java:180)
~[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.actor.ActorCell.invoke(ActorCell.scala:561)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.run(Mailbox.scala:225)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
[flink-dist_2.11-1.13.2.jar:1.13.2]
at
akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
[flink-dist_2.11-1.13.2.jar:1.13.2]
Exception in thread "OkHttp Dispatcher"
java.util.concurrent.RejectedExecutionException: Task
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask@393ed1
rejected from
java.util.concurrent.ScheduledThreadPoolExecutor@1fd81363[Terminated, pool size
= 0, active threads = 0, queued tasks = 0, completed tasks = 0]
at
java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:2063)
at java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:830)
at
java.util.concurrent.ScheduledThreadPoolExecutor.delayedExecute(ScheduledThreadPoolExecutor.java:326)
at
java.util.concurrent.ScheduledThreadPoolExecutor.schedule(ScheduledThreadPoolExecutor.java:533)
at
java.util.concurrent.ScheduledThreadPoolExecutor.submit(ScheduledThreadPoolExecutor.java:632)
at
java.util.concurrent.Executors$DelegatedExecutorService.submit(Executors.java:678)
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager.scheduleReconnect(WatchConnectionManager.java:305)
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager.access$800(WatchConnectionManager.java:50)
at
io.fabric8.kubernetes.client.dsl.internal.WatchConnectionManager$1.onFailure(WatchConnectionManager.java:218)
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket.failWebSocket(RealWebSocket.java:571)
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.ws.RealWebSocket$2.onResponse(RealWebSocket.java:198)
at
org.apache.flink.kubernetes.shaded.okhttp3.RealCall$AsyncCall.execute(RealCall.java:206)
at
org.apache.flink.kubernetes.shaded.okhttp3.internal.NamedRunnable.run(NamedRunnable.java:32)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2021-09-10 15:14:09,227 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint [] - Shutting
KubernetesSessionClusterEntrypoint down with application status UNKNOWN.
Diagnostics Cluster entrypoint has been closed externally..
2021-09-10 15:14:09,229 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Shutting down
rest endpoint.
2021-09-10 15:14:09,229 INFO org.apache.flink.runtime.blob.BlobServer [] -
Stopped BLOB server at 0.0.0.0:6124
{code}
{code:java}
kevin@road-condition-vm-flink-client:~$ kubectl get po
my-first-flink-clustercd-6d59756c7c-9fb7s -o yaml
apiVersion: v1
kind: Pod
metadata:
creationTimestamp: "2021-09-10T00:45:02Z"
generateName: my-first-flink-clustercd-6d59756c7c-
labels:
app: my-first-flink-clustercd
component: jobmanager
pod-template-hash: 6d59756c7c
type: flink-native-kubernetes
name: my-first-flink-clustercd-6d59756c7c-9fb7s
namespace: default
ownerReferences:
- apiVersion: apps/v1
blockOwnerDeletion: true
controller: true
kind: ReplicaSet
name: my-first-flink-clustercd-6d59756c7c
uid: d500b8e0-df0d-4544-b219-b36c83a3c1a5
resourceVersion: "3569128"
uid: c4fb95a0-b4c8-4dd5-828b-7f0f9077c09f
spec:
containers:
- args:
- bash
- -c
- kubernetes-jobmanager.sh kubernetes-session
command:
- /docker-entrypoint.sh
env:
- name: _POD_IP_ADDRESS
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: status.podIP
image: apache/flink:1.13.2-scala_2.11
imagePullPolicy: IfNotPresent
name: flink-main-container
ports:
- containerPort: 8081
name: rest
protocol: TCP
- containerPort: 6123
name: jobmanager-rpc
protocol: TCP
- containerPort: 6124
name: blobserver
protocol: TCP
resources:
limits:
cpu: "1"
memory: 1600Mi
requests:
cpu: "1"
memory: 1600Mi
terminationMessagePath: /dev/termination-log
terminationMessagePolicy: File
volumeMounts:
- mountPath: /opt/flink/conf
name: flink-config-volume
- mountPath: /var/run/secrets/kubernetes.io/serviceaccount
name: default-token-5crsm
readOnly: true
dnsPolicy: ClusterFirst
enableServiceLinks: true
nodeName: aks-agentpool-43117142-vmss000002
preemptionPolicy: PreemptLowerPriority
priority: 0
restartPolicy: Always
schedulerName: default-scheduler
securityContext: {}
serviceAccount: default
serviceAccountName: default
terminationGracePeriodSeconds: 30
tolerations:
- effect: NoExecute
key: node.kubernetes.io/not-ready
operator: Exists
tolerationSeconds: 300
- effect: NoExecute
key: node.kubernetes.io/unreachable
operator: Exists
tolerationSeconds: 300
- effect: NoSchedule
key: node.kubernetes.io/memory-pressure
operator: Exists
volumes:
- configMap:
defaultMode: 420
items:
- key: logback-console.xml
path: logback-console.xml
- key: log4j-console.properties
path: log4j-console.properties
- key: flink-conf.yaml
path: flink-conf.yaml
name: flink-config-my-first-flink-clustercd
name: flink-config-volume
- name: default-token-5crsm
secret:
defaultMode: 420
secretName: default-token-5crsm
status:
conditions:
- lastProbeTime: null
lastTransitionTime: "2021-09-10T00:47:00Z"
status: "True"
type: Initialized
- lastProbeTime: null
lastTransitionTime: "2021-09-10T15:19:31Z"
message: 'containers with unready status: [flink-main-container]'
reason: ContainersNotReady
status: "False"
type: Ready
- lastProbeTime: null
lastTransitionTime: "2021-09-10T15:19:31Z"
message: 'containers with unready status: [flink-main-container]'
reason: ContainersNotReady
status: "False"
type: ContainersReady
- lastProbeTime: null
lastTransitionTime: "2021-09-10T00:47:00Z"
status: "True"
type: PodScheduled
containerStatuses:
- containerID:
containerd://005e86da1e020dec3313103c1231bb916921545fd7b091007e7ef412d88917cd
image: docker.io/apache/flink:1.13.2-scala_2.11
imageID:
docker.io/apache/flink@sha256:2da83bf5f7437769ba1f04caed217d20bb02494a018162bfc6b2467e1914ad77
lastState:
terminated:
containerID:
containerd://005e86da1e020dec3313103c1231bb916921545fd7b091007e7ef412d88917cd
exitCode: 239
finishedAt: "2021-09-10T15:19:30Z"
reason: Error
startedAt: "2021-09-10T15:19:17Z"
name: flink-main-container
ready: false
restartCount: 168
started: false
state:
waiting:
message: back-off 5m0s restarting failed container=flink-main-container
pod=my-first-flink-clustercd-6d59756c7c-9fb7s_default(c4fb95a0-b4c8-4dd5-828b-7f0f9077c09f)
reason: CrashLoopBackOff
hostIP: 10.240.0.5
phase: Running
podIP: 10.244.2.3
podIPs:
- ip: 10.244.2.3
qosClass: Guaranteed
startTime: "2021-09-10T00:47:00Z"{code}
> 'Run kubernetes session test (default input)' failed on Azure
> -------------------------------------------------------------
>
> Key: FLINK-21647
> URL: https://issues.apache.org/jira/browse/FLINK-21647
> Project: Flink
> Issue Type: Bug
> Components: Deployment / Kubernetes
> Affects Versions: 1.13.0
> Reporter: Jark Wu
> Priority: Blocker
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=14236&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=2247
--
This message was sent by Atlassian Jira
(v8.3.4#803005)