We have a similar issue that only seems to occur during long running jobs (over 5 hours). The traceback is different but we also see the EOF exception. My client is Ubuntu linux
I tried to trace the issue by running tcpdumps at both ends on the ssh session from the master to the slave but I saw nothing amiss. I suspect that there is a connection inside the ssh session but that will be hard to catch using tcpdump. I wonder if I need to enable some kind of keep alives..... On Tuesday, September 29, 2020 at 4:47:22 AM UTC-4 [email protected] wrote: > Hi Team, We are using JNLP to connect Mac agent to Linux master node. > > Jenkins agent keeps disconnecting frequently, and we are getting below > logs in master. > > Can you please suggest how to resolve this? What are the steps to further > triage the same. > > Some of the questions we are trying to answer is: > > - What is EOFException? > - Why does agent tries to connect to master when its already connected? > - Why does eventually the ping / connection fails? > > We keep seeing this pattern in logs too often and too frequently. Any help > would be appreciated. > > Results are same even if we try any of the below options: > > - Connected using *Launch agent from Browser* > - Connected by *starting automator* in Mac which runs shell/zsh to run > agent.jar > - Connected by running *plist* in Mac > > * Jenkins environment:* > > - Jenkins: 2.249.1 > - Master Node: Linux RHEL 8.1 > - Master Java Version: 1.8.0_242 > - Slave System: macOS Catalina, Version 10.15.6 > - Slave Java Version: 1.8.0_261 > > *Connection #xxx failed: java.io.EOFException Sep 29, 2020 2:45:21 AM > INFO hudson.TcpSlaveAgentListener$ConnectionHandler run Accepted > JNLP4-connect connection #xxx from x.x.x.x/x.x.x.x:57215 Sep 29, 2020 > 2:45:21 AM > INFO org.jenkinsci.remoting.protocol.impl.ConnectionHeadersFilterLayer on > Recv[JNLP4-connect connection from x.x.x.x/x.x.x.x:57215] Refusing headers > from remote: <agent_name> is already connected to this master. Rejecting > this connection.Sep 29, 2020 2:45:31 AM > INFO hudson.TcpSlaveAgentListener$ConnectionHandler runConnection #xxx > failed: java.io.EOFException Sep 29, 2020 2:45:31 > AM INFO hudson.TcpSlaveAgentListener$ConnectionHandler runAccepted > JNLP4-connect connection #xxx from x.x.x.x/x.x.x.x:57218 Sep 29, 2020 > 2:45:32 AM > INFO org.jenkinsci.remoting.protocol.impl.ConnectionHeadersFilterLayer > onRecv[JNLP4-connect connection from x.x.x.x/x.x.x.x] Refusing headers from > remote: <agent_name> is already connected to this master. Rejecting this > connection.Sep 29, 2020 2:45:32 AM INFO hudson.slaves.ChannelPinger$1 > onDeadPing failed. Terminating the channel JNLP4-connect connection from > x.x.x.x/x.x.x.x:57015. java.util.concurrent.TimeoutException: Ping started > at 1601318492966 hasn't completed by 1601318732966 at > hudson.remoting.PingThread.ping(PingThread.java:134) at > hudson.remoting.PingThread.run(PingThread.java:90) * > > > > TIA > > > -- You received this message because you are subscribed to the Google Groups "Jenkins Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/jenkinsci-users/2eeea79b-b0bb-40cb-9181-94ca2361182bn%40googlegroups.com.
