Attila Doroszlai created HDDS-9814:
--------------------------------------
Summary: NettyClientStreamRpc trying to establish connection
during shutdown
Key: HDDS-9814
URL: https://issues.apache.org/jira/browse/HDDS-9814
Project: Apache Ozone
Issue Type: Bug
Reporter: Attila Doroszlai
{{TestOzoneFileSystemWithStreaming}} logs lots of {{"connection refused"}} and
{{"schedule reconnecting"}} messages during shutdown.
{code:title=sample}
2023-11-30 18:39:00,039 [main] INFO ozone.MiniOzoneClusterImpl
(MiniOzoneClusterImpl.java:shutdown(449)) - Shutting down the Mini Ozone Cluster
...
2023-11-30 18:39:01,126 [Thread-1140] DEBUG client.NettyClientStreamRpc
(NettyClientStreamRpc.java:close(452)) - flush? true
2023-11-30 18:39:01,122 [Thread-1133] WARN util.PeerProxyMap
(PeerProxyMap.java:closeProxy(170)) -
2ad30381-d2ef-4801-b4d0-3965e2d4accc-NettyServerStreamRpc: Failed to close
proxy for peer 3c309c61-fe70-460a-bb5b-539134755074|127.0.0.1:15014, proxy
class: class org.apache.ratis.client.impl.DataStreamClientImpl
org.apache.ratis.thirdparty.io.netty.channel.AbstractChannel$AnnotatedConnectException:
finishConnect(..) failed: Connection refused: /127.0.0.1:15015
Caused by: java.net.ConnectException: finishConnect(..) failed: Connection
refused
at
org.apache.ratis.thirdparty.io.netty.channel.unix.Errors.newConnectException0(Errors.java:166)
at
org.apache.ratis.thirdparty.io.netty.channel.unix.Errors.handleConnectErrno(Errors.java:131)
at
org.apache.ratis.thirdparty.io.netty.channel.unix.Socket.finishConnect(Socket.java:359)
at
org.apache.ratis.thirdparty.io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe.doFinishConnect(AbstractEpollChannel.java:710)
at
org.apache.ratis.thirdparty.io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe.finishConnect(AbstractEpollChannel.java:687)
at
org.apache.ratis.thirdparty.io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe.epollOutReady(AbstractEpollChannel.java:567)
at
org.apache.ratis.thirdparty.io.netty.channel.epoll.EpollEventLoop.processReady(EpollEventLoop.java:499)
at
org.apache.ratis.thirdparty.io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:407)
at
org.apache.ratis.thirdparty.io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997)
at
org.apache.ratis.thirdparty.io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
at java.lang.Thread.run(Thread.java:750)
{code}
This is during a normal, successful test run.
On a related note, sometimes test run "hangs" (fork timeout, or more recently,
job runner timeout). Unfortunately we don't have logs for the latter.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]