Hi Team,

I am using AWS Instance as a Jenkins server which has different kind of 
nodes ( AWS EC2 Instance, Linux and Windows Nodes )
We are using this Jenkins server to execute UT and Functional test to 
qualify the builds.

We are facing "Slave went offline during the build" issue frequently. ( 
Note For EC2 instance its less frequency )

Jenkins version : 1.658
OS : Ubuntu 14.04.2 LTS

I need some help on fixing "Slave went offline during the build" issue.

Observed different error messages in log during the slave went offline, In 
that following 2 messages we observed frequently

1) SEVERE: I/O error in channel followed by Unexpected termination of the 
channel
2) TcpSlaveAgentListener$ConnectionHandler error followed by 
Protocol:JNLP3-connect ( This is mostly in Windows machine )


Whenever the Job failed due to slave went offline issue we are seeing 
either one of the message as above. 

Please refer the following logs for reference.

Note : 1) We have raised Internal IT Ticket to monitor Packet loss of AWS 
Instance to my Company network and we confirmed that there were no packet 
loss
       2) We Raised AWS Support ticket to make sure do we need to tune 
anything specific with respect in AWS Side and confirmed that everything is 
perfect.

1) We are connecting the nodes on-demand with 1 minute of delay
2) Disabled "Response Time" in preventive node monitoring settings 

But still we are observing this issue.

It would be great if somebody guides me to fix this issue.


Thanks,
Gopal

-- 
You received this message because you are subscribed to the Google Groups 
"Jenkins Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/jenkinsci-users/0fdc68f8-c9c0-4b8f-8036-7909feafef75%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
SEVERE: I/O error in channel followed by Unexpected termination of the channel


Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:42:22 PM 
hudson.remoting.SynchronousCommandTransport$ReaderThread run
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: SEVERE: I/O error in channel 
10_39_19_5_Client_Node
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: java.io.IOException: Unexpected 
termination of the channel
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:50)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: Caused by: java.io.EOFException
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2335)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:2804)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:802)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream.<init>(ObjectInputStream.java:299)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.ObjectInputStreamEx.<init>(ObjectInputStreamEx.java:48)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]:
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:42:22 PM 
net.bull.javamelody.JavaLogger warn
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: WARNING: exception while collecting 
data
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: 
hudson.remoting.ChannelClosedException: channel is already closed
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.Channel.send(Channel.java:578)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.Request.callAsync(Request.java:205)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.Channel.callAsync(Channel.java:807)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
net.bull.javamelody.RemoteCallHelper.collectDataByNodeName(RemoteCallHelper.java:159)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
net.bull.javamelody.RemoteCallHelper.collectJavaInformationsListByName(RemoteCallHelper.java:183)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
net.bull.javamelody.NodesCollector.collectWithoutErrorsNow(NodesCollector.java:154)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
net.bull.javamelody.NodesCollector.collectWithoutErrors(NodesCollector.java:143)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
net.bull.javamelody.NodesCollector$2.run(NodesCollector.java:111)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
java.util.TimerThread.mainLoop(Timer.java:555)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
java.util.TimerThread.run(Timer.java:505)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: Caused by: java.io.IOException
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.Channel.close(Channel.java:1163)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.slaves.ChannelPinger$1.onDead(ChannelPinger.java:118)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.PingThread.ping(PingThread.java:126)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.PingThread.run(PingThread.java:85)
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: Caused by: 
java.util.concurrent.TimeoutException: Ping started at 1491175712681 hasn't 
completed by 1491175952681
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]: #011... 2 more
Apr  2 16:42:22 ip-172-28-5-15 [jenkins]:
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:42:24 PM 
hudson.remoting.SynchronousCommandTransport$ReaderThread run
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: SEVERE: I/O error in channel gkrishnan
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: java.io.IOException: Unexpected 
termination of the channel
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:50)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: Caused by: java.io.EOFException
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2335)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:2804)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:802)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream.<init>(ObjectInputStream.java:299)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.ObjectInputStreamEx.<init>(ObjectInputStreamEx.java:48)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.AbstractSynchronousByteArrayCommandTransport.read(AbstractSynchronousByteArrayCommandTransport.java:34)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:48)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]:
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:42:24 PM 
hudson.remoting.SynchronousCommandTransport$ReaderThread run
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: SEVERE: I/O error in channel 
FC23_sai_workstation
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: java.io.IOException: Unexpected 
termination of the channel
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
hudson.remoting.SynchronousCommandTransport$ReaderThread.run(SynchronousCommandTransport.java:50)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: Caused by: java.io.EOFException
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream$PeekInputStream.readFully(ObjectInputStream.java:2335)
Apr  2 16:42:24 ip-172-28-5-15 [jenkins]: #011at 
java.io.ObjectInputStream$BlockDataInputStream.readShort(ObjectInputStream.java:2804)



TcpSlaveAgentListener$ConnectionHandler error followed by 
Protocol:JNLP3-connect ( This is mostly in Windows machine )



Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: INFO: Accepted connection #1,013 from 
/10.36.199.17:4584
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler error
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: WARNING: Connection #1013 is aborted: 
Unknown protocol:Protocol:JNLP3-connect
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler run
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: INFO: Accepted connection #1,014 from 
/10.36.199.17:4585
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
jenkins.slaves.DefaultJnlpSlaveReceiver handle
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: INFO: Disconnecting 
win_strawman_instance as we are reconnected from the current peer
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler run
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: INFO: Accepted connection #1,015 from 
/10.36.200.16:60078
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler error
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: WARNING: Connection #1015 is aborted: 
Unknown protocol:Protocol:JNLP3-connect
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler run
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: INFO: Accepted connection #1,016 from 
/10.36.200.16:60079
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:03 PM 
jenkins.slaves.DefaultJnlpSlaveReceiver handle
Apr  2 16:27:03 ip-172-28-5-15 [jenkins]: INFO: Disconnecting 
ATEP_UI_Windows_8_10_36_200_16 as we are reconnected from the current peer
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:09 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler run
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: INFO: Accepted connection #1,017 from 
/10.36.200.17:60446
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:09 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler run
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: INFO: Accepted connection #1,018 from 
/10.36.198.17:64140
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:09 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler error
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: WARNING: Connection #1017 is aborted: 
Unknown protocol:Protocol:JNLP3-connect
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:09 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler error
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: WARNING: Connection #1018 is aborted: 
Unknown protocol:Protocol:JNLP3-connect
Apr  2 16:27:09 ip-172-28-5-15 [jenkins]: Apr 02, 2017 4:27:09 PM 
hudson.TcpSlaveAgentListener$ConnectionHandler run

Reply via email to