We have a similar issue that only seems to occur during long running jobs 
(over 5 hours). The traceback is different but we also see the EOF 
exception. My client is Ubuntu linux 

I tried to trace the issue by running tcpdumps at both ends on the ssh 
session from the master to the slave but I saw nothing amiss. I suspect 
that there is a connection inside the ssh session but that will be hard to 
catch using tcpdump. 

I wonder if I need to enable some kind of keep alives.....


On Tuesday, September 29, 2020 at 4:47:22 AM UTC-4 [email protected] 
wrote:

> Hi Team, We are using JNLP to connect Mac agent to Linux master node.
>
> Jenkins agent keeps disconnecting frequently, and we are getting below 
> logs in master.
>
> Can you please suggest how to resolve this? What are the steps to further 
> triage the same.
>
> Some of the questions we are trying to answer is:
>
>    - What is EOFException?
>    - Why does agent tries to connect to master when its already connected?
>    - Why does eventually the ping / connection fails? 
>
> We keep seeing this pattern in logs too often and too frequently. Any help 
> would be appreciated.
>
> Results are same even if we try any of the below options:
>
>    - Connected using *Launch agent from Browser*
>    - Connected by *starting automator* in Mac which runs shell/zsh to run 
>    agent.jar
>    - Connected by running *plist* in Mac
>
> * Jenkins environment:*
>
>    - Jenkins: 2.249.1
>    - Master Node: Linux RHEL 8.1
>    - Master Java Version: 1.8.0_242
>    - Slave System: macOS Catalina, Version 10.15.6
>    - Slave Java Version: 1.8.0_261
>
> *Connection #xxx failed: java.io.EOFException Sep 29, 2020 2:45:21 AM  
> INFO hudson.TcpSlaveAgentListener$ConnectionHandler run Accepted 
> JNLP4-connect connection #xxx from x.x.x.x/x.x.x.x:57215 Sep 29, 2020 
> 2:45:21 AM  
> INFO org.jenkinsci.remoting.protocol.impl.ConnectionHeadersFilterLayer on 
> Recv[JNLP4-connect connection from x.x.x.x/x.x.x.x:57215] Refusing headers 
> from remote: <agent_name> is already connected to this master. Rejecting 
> this connection.Sep 29, 2020 2:45:31 AM  
> INFO hudson.TcpSlaveAgentListener$ConnectionHandler runConnection #xxx 
> failed: java.io.EOFException Sep 29, 2020 2:45:31 
> AM INFO hudson.TcpSlaveAgentListener$ConnectionHandler runAccepted 
> JNLP4-connect connection #xxx from x.x.x.x/x.x.x.x:57218 Sep 29, 2020 
> 2:45:32 AM  
> INFO org.jenkinsci.remoting.protocol.impl.ConnectionHeadersFilterLayer 
> onRecv[JNLP4-connect connection from x.x.x.x/x.x.x.x] Refusing headers from 
> remote: <agent_name> is already connected to this master. Rejecting this 
> connection.Sep 29, 2020 2:45:32 AM INFO hudson.slaves.ChannelPinger$1 
> onDeadPing failed. Terminating the channel JNLP4-connect connection from 
> x.x.x.x/x.x.x.x:57015. java.util.concurrent.TimeoutException: Ping started 
> at 1601318492966 hasn't completed by 1601318732966        at 
> hudson.remoting.PingThread.ping(PingThread.java:134)        at 
> hudson.remoting.PingThread.run(PingThread.java:90) *
>
>  
>
> TIA
>
>  
>

-- 
You received this message because you are subscribed to the Google Groups 
"Jenkins Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/jenkinsci-users/2eeea79b-b0bb-40cb-9181-94ca2361182bn%40googlegroups.com.

Reply via email to