[
https://issues.apache.org/jira/browse/STORM-1394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287468#comment-15287468
]
Kevin Conaway commented on STORM-1394:
--------------------------------------
There is some related discussion to this on STORM-946
> Netty Client never continue reconnection when worker started a moment ago.
> --------------------------------------------------------------------------
>
> Key: STORM-1394
> URL: https://issues.apache.org/jira/browse/STORM-1394
> Project: Apache Storm
> Issue Type: Bug
> Components: storm-core
> Reporter: Jingsong Lee
>
> Worker will wait until all connections are ok.
> This is a situation lead to worker hang forever.
> 1.worker A and B started.
> 2.worker A wait all connections are ok.
> 3.worker A connect to B.
> 4.B dead when A is not active(other connections are not ok).
> 5.B is launched by B' supervisor again (Assignment is not changed).
> 6.A hang forever because there is nobody to reconnect the client of A to B.
> We can fix this problem by 2 method.
> 1.Add closeChannelAndReconnect in Client' status
> Or
> 2.Add closeChannelAndReconnect in StormClientHandler' exceptionCaught
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)