zhu created FLINK-32943:
---------------------------
Summary: sometime run batch tasks concurrently, the tasks still in
the initialization status
Key: FLINK-32943
URL: https://issues.apache.org/jira/browse/FLINK-32943
Project: Flink
Issue Type: Bug
Environment: flink 1.15.2
|*lob.server.port*|6124|
|*classloader.resolve-order*|parent-first|
|*jobmanager.execution.failover-strategy*|region|
|*jobmanager.memory.heap.size*|2228014280b|
|*jobmanager.memory.jvm-metaspace.size*|536870912b|
|*jobmanager.memory.jvm-overhead.max*|322122552b|
|*jobmanager.memory.jvm-overhead.min*|322122552b|
|*jobmanager.memory.off-heap.size*|134217728b|
|*jobmanager.memory.process.size*|3gb|
|*jobmanager.rpc.address*|naf-flink-ms-flink-manager-1-4gcwz|
|*jobmanager.rpc.port*|6123|
|*parallelism.default*|1|
|*query.server.port*|6125|
|*rest.address*|0.0.0.0|
|*rest.bind-address*|0.0.0.0|
|*rest.connection-timeout*|60000|
|*rest.server.numThreads*|8|
|*slot.request.timeout*|3000000|
|*state.backend.rocksdb.localdir*|/home/nafplat/data/flinkStateStore|
|*state.backend.type*|rocksdb|
|*taskmanager.bind-host*|0.0.0.0|
|*taskmanager.host*|0.0.0.0|
|*taskmanager.memory.framework.off-heap.batch-shuffle.size*|256mb|
|*taskmanager.memory.framework.off-heap.size*|512mb|
|*taskmanager.memory.managed.fraction*|0.4|
|*taskmanager.memory.network.fraction*|0.2|
|*taskmanager.memory.process.size*|16gb|
|*taskmanager.memory.task.off-heap.size*|268435456bytes|
|*taskmanager.numberOfTaskSlots*|6|
|*taskmanager.runtime.large-record-handler*|true|
|*web.submit.enable*|true|
|*web.tmpdir*|/tmp/flink-web-4be192ba-870a-4f88-8185-d07fa6303cca|
|*web.upload.dir*|/opt/flink/nafJar|
Reporter: zhu
run 1.15.2 flink session on k8s,In most cases, there is no problem. Sometimes,
tasks are initialized continuously, and subsequent tasks are also initialized
continuously,
I run batch job with 6 concurrent,jobmanage with 2cpu and 3g Memory
This problem always occurs, it seems that there is a deadlock during
initialization, but my job does not have any deadlock issues
--
This message was sent by Atlassian Jira
(v8.20.10#820010)