[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802819#action_12802819
 ] 

Qian Ye commented on ZOOKEEPER-650:
-----------------------------------

Hi all, some more information about this problem, I find that the status of the 
election ports of the three working servers is strange. For example, the server 
10.23.150.29, 

$ netstat -anp | grep 8888
(Not all processes could be identified, non-owned process info
 will not be shown, you would have to be root to see it all.)
tcp        0      0 0.0.0.0:8888                0.0.0.0:*                   
LISTEN      -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23933          
CLOSE_WAIT  -                   
tcp   157577      0 10.23.150.29:8888           10.65.27.21:10482           
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23929          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23672          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23671          
CLOSE_WAIT  -                   
tcp       41      0 10.23.150.29:8888           10.23.247.141:10790         
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23669          
CLOSE_WAIT  -                   
tcp      136      0 10.23.150.29:8888           10.23.247.141:10791         
ESTABLISHED -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23668          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23667          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23923          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23666          
CLOSE_WAIT  -                   
tcp       73      0 10.23.150.29:8888           10.23.247.141:10786         
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23664          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23663          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23662          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23661          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23660          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23659          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23656          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23651          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23648          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23647          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23646          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23643          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23642          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23640          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23639          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23638          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23637          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23636          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23635          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23634          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23633          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23630          
CLOSE_WAIT  -                   
tcp        6      0 10.23.150.29:8888           10.23.253.43:23620          
CLOSE_WAIT  -                   
tcp      617      0 10.23.150.29:8888           10.65.27.21:28984           
CLOSE_WAIT  -                   
tcp        0      0 10.23.150.29:10593          10.23.253.43:8888           
CLOSE_WAIT  -                   
tcp       51      0 10.23.150.29:8888           10.23.253.43:23712          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23703          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23702          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23699          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23697          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23695          
CLOSE_WAIT  -                   
tcp        1      0 10.23.150.29:8888           10.65.27.21:29185           
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23694          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23692          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23690          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23688          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23686          
CLOSE_WAIT  -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23685          
CLOSE_WAIT  -                   
tcp        0      0 10.23.150.29:8888           10.65.20.68:10581           
ESTABLISHED -                   
tcp        9      0 10.23.150.29:8888           10.23.253.43:23684          
CLOSE_WAIT  -    

There are so many socket in the status of CLOSE_WAIT that, the election port 
cannot accept an extra connection, the maximum of backlog is reached. Notice 
that most of the recieving queues of these sockets are not empty. It seems that 
the process is hung somewhere. The other two working server share the same 
situation. I think there is no other choice but restart the servers to get rid 
of the trouble, right?

> Servers cannot join in quorum
> -----------------------------
>
>                 Key: ZOOKEEPER-650
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-650
>             Project: Zookeeper
>          Issue Type: Bug
>          Components: quorum
>    Affects Versions: 3.2.1
>         Environment: Linux 2.6.9 x86_64
>            Reporter: Qian Ye
>
> Server fails to join ensemble.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to