Scomocouk opened a new pull request, #3739:
URL: https://github.com/apache/storm/pull/3739

   ## What is the purpose of the change
   
   Fixes STORM-4104 to improve the stability of Pacemaker server (and Netty 
Server in Workers) bringing it closer to the stability it had in Storm1. This 
is done by changing the behaviour of Utils.handleUncaughtException to also 
swallow exceptions derived from those in the supplied ALLOWED_EXCEPTIONS list 
instead of only the exact exceptions defined in there (i.e. instead of only 
'swallowing' IOException, this change will allow SocketException (derived from 
IOException) to also be 'swallowed')
   
   ## How was the change tested
   
   Before this change it is easy to see Pacemaker server shut down when simply 
killing a topology:
   2024-11-06 15:21:13.251 [o.a.s.m.n.StormServerHandler] ERROR: server errors 
in handling the request from /10.44.19.106:59691
   java.net.SocketException: Connection reset   at 
sun.nio.ch.SocketChannelImpl.throwConnectionReset(SocketChannelImpl.java:401) 
~[?:?]
   <...snip...>
   2024-11-06 15:21:13.270 [o.a.s.m.n.StormServerHandler] INFO: Received error 
in netty thread.. terminating server...
   
   With this fix the Pacemaker server now logs that a SocketException was 
received but that it is 'swallowing' it and pacemaker server continues to run:
   2024-11-06 16:19:46.860 [o.a.s.u.Utils] INFO: Swallowing class 
java.net.SocketException java.net.SocketException: Connection reset
   
   Also added unit tests for the new behaviour of  Utils.handleUncaughtException


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@storm.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to