[ 
https://issues.apache.org/jira/browse/HADOOP-11780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15526395#comment-15526395
 ] 

Kihwal Lee commented on HADOOP-11780:
-------------------------------------

The patch looks good. We have been running with this for a long time. The only 
difference is the addition of jvm termination, which was done to take care of 
other cases reported HADOOP-13657.  [~zhz], are you okay with this patch?

> Prevent IPC reader thread death
> -------------------------------
>
>                 Key: HADOOP-11780
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11780
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: ipc
>    Affects Versions: 2.0.0-alpha
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>            Priority: Critical
>         Attachments: HADOOP-11780.patch
>
>
> Reader threads can die to a race condition with the responder thread.  If the 
> server's ipc handler cannot send a response in one write, it delegates 
> sending the rest of the response to the responder thread.
> The race occurs when the responder thread has an exception writing to the 
> socket.  The responder closes the socket.  This wakes up the reader polling 
> on the socket.  If a {{CancelledKeyException}} is thrown, which is a runtime 
> exception, the reader dies.  All connections serviced by that reader are now 
> in limbo until the client possibly times out.  New connections play roulette 
> as to whether they are assigned to a defunct reader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to