[ https://issues.apache.org/jira/browse/HADOOP-11780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15526395#comment-15526395 ]
Kihwal Lee commented on HADOOP-11780: ------------------------------------- The patch looks good. We have been running with this for a long time. The only difference is the addition of jvm termination, which was done to take care of other cases reported HADOOP-13657. [~zhz], are you okay with this patch? > Prevent IPC reader thread death > ------------------------------- > > Key: HADOOP-11780 > URL: https://issues.apache.org/jira/browse/HADOOP-11780 > Project: Hadoop Common > Issue Type: Bug > Components: ipc > Affects Versions: 2.0.0-alpha > Reporter: Daryn Sharp > Assignee: Daryn Sharp > Priority: Critical > Attachments: HADOOP-11780.patch > > > Reader threads can die to a race condition with the responder thread. If the > server's ipc handler cannot send a response in one write, it delegates > sending the rest of the response to the responder thread. > The race occurs when the responder thread has an exception writing to the > socket. The responder closes the socket. This wakes up the reader polling > on the socket. If a {{CancelledKeyException}} is thrown, which is a runtime > exception, the reader dies. All connections serviced by that reader are now > in limbo until the client possibly times out. New connections play roulette > as to whether they are assigned to a defunct reader. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org