[
https://issues.apache.org/jira/browse/FLINK-33014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matthias Pohl updated FLINK-33014:
----------------------------------
Description:
The Flink cluster was deployed using the Docker image of Flink 1.17.1 java8.
After deployment, on k8s, in standalone form, jobmanager printed this error at
intervals, and taskmanager did not print any errors,
There are currently no jobs running
{code:java}
2023-09-01 11:34:14,293 WARN
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Unhandled
exception
java.io.IOException: Connection reset by peer
at sun.nio.ch.FileDispatcherImpl.read0(Native Method) ~[?:1.8.0_372]
at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39) ~[?:1.8.0_372]
at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223) ~[?:1.8.0_372]
at sun.nio.ch.IOUtil.read(IOUtil.java:192) ~[?:1.8.0_372]
at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
~[?:1.8.0_372]
at
org.apache.flink.shaded.netty4.io.netty.buffer.PooledByteBuf.setBytes(PooledByteBuf.java:258)
~[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1132)
~[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.channel.socket.nio.NioSocketChannel.doReadBytes(NioSocketChannel.java:357)
~[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:151)
[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:788)
[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:724)
[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:650)
[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:562)
[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997)
[flink-dist-1.17.1.jar:1.17.1]
at
org.apache.flink.shaded.netty4.io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
[flink-dist-1.17.1.jar:1.17.1]
at java.lang.Thread.run(Thread.java:750) [?:1.8.0_372]
{code}
was:
The Flink cluster was deployed using the Docker image of Flink 1.17.1 java8.
After deployment, on k8s, in standalone form, jobmanager printed this error at
intervals, and taskmanager did not print any errors,
There are currently no jobs running
{code:java}
2023-09-01 11:34:14,293 WARN
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Unhandled
exceptionjava.io.IOException: Connection reset by peer at
sun.nio.ch.FileDispatcherImpl.read0(Native Method) ~[?:1.8.0_372] at
sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39) ~[?:1.8.0_372] at
sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223) ~[?:1.8.0_372] at
sun.nio.ch.IOUtil.read(IOUtil.java:192) ~[?:1.8.0_372] at
sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379) ~[?:1.8.0_372]
at
org.apache.flink.shaded.netty4.io.netty.buffer.PooledByteBuf.setBytes(PooledByteBuf.java:258)
~[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1132)
~[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.channel.socket.nio.NioSocketChannel.doReadBytes(NioSocketChannel.java:357)
~[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:151)
[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:788)
[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:724)
[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:650)
[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:562)
[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997)
[flink-dist-1.17.1.jar:1.17.1] at
org.apache.flink.shaded.netty4.io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
[flink-dist-1.17.1.jar:1.17.1] at java.lang.Thread.run(Thread.java:750)
[?:1.8.0_372] {code}
> flink jobmanager raise java.io.IOException: Connection reset by peer
> ---------------------------------------------------------------------
>
> Key: FLINK-33014
> URL: https://issues.apache.org/jira/browse/FLINK-33014
> Project: Flink
> Issue Type: Bug
> Affects Versions: 1.17.1
> Environment: |*blob.server.port*|6124|
> |*classloader.resolve-order*|parent-first|
> |*jobmanager.execution.failover-strategy*|region|
> |*jobmanager.memory.heap.size*|2228014280b|
> |*jobmanager.memory.jvm-metaspace.size*|536870912b|
> |*jobmanager.memory.jvm-overhead.max*|322122552b|
> |*jobmanager.memory.jvm-overhead.min*|322122552b|
> |*jobmanager.memory.off-heap.size*|134217728b|
> |*jobmanager.memory.process.size*|3gb|
> |*jobmanager.rpc.address*|naf-flink-ms-flink-manager-1-59m7w|
> |*jobmanager.rpc.port*|6123|
> |*parallelism.default*|1|
> |*query.server.port*|6125|
> |*rest.address*|0.0.0.0|
> |*rest.bind-address*|0.0.0.0|
> |*rest.connection-timeout*|60000|
> |*rest.server.numThreads*|8|
> |*slot.request.timeout*|3000000|
> |*state.backend.rocksdb.localdir*|/home/nafplat/data/flinkStateStore|
> |*state.backend.type*|rocksdb|
> |*taskmanager.bind-host*|0.0.0.0|
> |*taskmanager.host*|0.0.0.0|
> |*taskmanager.memory.framework.off-heap.batch-shuffle.size*|256mb|
> |*taskmanager.memory.framework.off-heap.size*|512mb|
> |*taskmanager.memory.managed.fraction*|0.4|
> |*taskmanager.memory.network.fraction*|0.2|
> |*taskmanager.memory.process.size*|5gb|
> |*taskmanager.memory.task.off-heap.size*|268435456bytes|
> |*taskmanager.numberOfTaskSlots*|2|
> |*taskmanager.runtime.large-record-handler*|true|
> |*web.submit.enable*|true|
> |*web.tmpdir*|/tmp/flink-web-c1b57e2b-5426-4fb8-a9ce-5acd1cceefc9|
> |*web.upload.dir*|/opt/flink/nafJar|
> Reporter: zhu
> Priority: Major
>
>
> The Flink cluster was deployed using the Docker image of Flink 1.17.1 java8.
> After deployment, on k8s, in standalone form, jobmanager printed this error
> at intervals, and taskmanager did not print any errors,
> There are currently no jobs running
> {code:java}
> 2023-09-01 11:34:14,293 WARN
> org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] - Unhandled
> exception
> java.io.IOException: Connection reset by peer
> at sun.nio.ch.FileDispatcherImpl.read0(Native Method) ~[?:1.8.0_372]
> at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:39)
> ~[?:1.8.0_372]
> at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:223) ~[?:1.8.0_372]
> at sun.nio.ch.IOUtil.read(IOUtil.java:192) ~[?:1.8.0_372]
> at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:379)
> ~[?:1.8.0_372]
> at
> org.apache.flink.shaded.netty4.io.netty.buffer.PooledByteBuf.setBytes(PooledByteBuf.java:258)
> ~[flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1132)
> ~[flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.channel.socket.nio.NioSocketChannel.doReadBytes(NioSocketChannel.java:357)
> ~[flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:151)
> [flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:788)
> [flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:724)
> [flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:650)
> [flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:562)
> [flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997)
> [flink-dist-1.17.1.jar:1.17.1]
> at
> org.apache.flink.shaded.netty4.io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
> [flink-dist-1.17.1.jar:1.17.1]
> at java.lang.Thread.run(Thread.java:750) [?:1.8.0_372]
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)