[ 
https://issues.apache.org/jira/browse/HADOOP-12605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15056880#comment-15056880
 ] 

Xiao Chen commented on HADOOP-12605:
------------------------------------

Hi [~iwasakims],
Thanks for the explanation and the patch. Sorry for my delayed response.

I'm not sure about relaxing the check between 
{{server.getNumOpenConnections()}} and {{maxAccept}} - that would no longer 
check the intention of 'we didn't go over'. I hope the original author [~daryn] 
could take a look and advice.

Also, the failure happens for {{checkBlocking(1, 1, 1)}}, so I'll discuss this 
case here for simplicity. With the {{firstCallLatch}} in place in the test, I 
didn't find anywhere the server can accept more connections. Could you further 
explain your theory?
>From my understanding, the call goes briefly like this: client call -> 
>server's listener -> add to readerQ -> reader take from readerQ -> put to 
>callQ -> handler take from callQ -> call (blocked by the latch). In the (1, 1, 
>1) case we only have 1 reader with readerQ size=1, and callQ size =1, so IMO 
>neither the round robin nor the loop of taking pendingConnections should 
>contribute anything.

I'm trying to hack the test to result in a similar failure but haven't 
succeeded so far. [~iwasakims], would you please share some reproduce code if 
possible? Thanks very much.

> Fix intermittent failure of TestIPC.testIpcWithReaderQueuing
> ------------------------------------------------------------
>
>                 Key: HADOOP-12605
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12605
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: test
>            Reporter: Masatake Iwasaki
>            Assignee: Masatake Iwasaki
>            Priority: Minor
>         Attachments: HADOOP-12605.001.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to