[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16950685#comment-16950685
 ] 

MachelCheng edited comment on ZOOKEEPER-1330 at 10/14/19 3:13 AM:
------------------------------------------------------------------

Hi,[~hanm]  [~fpj]

In my project, I just encountered this problem. Leader election took 25 
seconds, but the follower took 374 seconds. The log is as follows:

2019-10-12 23:00:20,354 [myid:3] - INFO 
[QuorumPeer[myid=3]/0:0:0:0:0:0:0:0:2181:Leader@380] - LEADING - LEADER 
ELECTION TOOK - 25507
 2019-10-12 23:06:54,692 [myid:1] - INFO 
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:2181:Follower@65] - FOLLOWING - LEADER 
ELECTION TOOK - 374511

 

During this period, the client could not connect to zookeeper server, and there 
are many logs as follows:

WARN [NIOServerCxn.Factory: 0.0.0.0/0.0.0.0:2181:NIOServerCnxn@362] - Exception 
causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not 
running

There are three questions, who can help explain:
 1. Why the time required to select a follower needs to be so long, and the 
time difference from the leader election is so large;
 2. When does zookeeper provide services normally? Need followers‘s election 
process is done?  Or just  the leader election is done?
 3. How to solve this problem?  This is the most important and I want to know 
the most.

 


was (Author: machelcheng):
Hi,[~hanm]  [~fpj]

In my project, I just encountered this problem. Leader election 25 seconds, but 
the follower took 374 seconds. The log is as follows:

2019-10-12 23:00:20,354 [myid:3] - INFO 
[QuorumPeer[myid=3]/0:0:0:0:0:0:0:0:2181:Leader@380] - LEADING - LEADER 
ELECTION TOOK - 25507
2019-10-12 23:06:54,692 [myid:1] - INFO 
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:2181:Follower@65] - FOLLOWING - LEADER 
ELECTION TOOK - 374511

 

During this period, the client could not connect to zookeeper server, and there 
are many logs as follows:

WARN [NIOServerCxn.Factory: 0.0.0.0/0.0.0.0:2181:NIOServerCnxn@362] - Exception 
causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not 
running


There are three questions, who can help explain:
1. Why the time required to select a follower needs to be so long, and the time 
difference from the leader election is so large;
2. When does zookeeper provide services normally? Need followers‘s election 
process is done?  Or just  the leader election is done?
3. How to solve this problem?  This is the most important and I want to know 
the most.

 

> Zookeeper server not serving the client request even after completion of 
> Leader election
> ----------------------------------------------------------------------------------------
>
>                 Key: ZOOKEEPER-1330
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1330
>             Project: ZooKeeper
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 3.4.0
>         Environment: 3 zk quorum
>            Reporter: amith
>            Priority: Minor
>             Fix For: 3.6.0, 3.5.7
>
>
> Have a cluster of 3 zookeepers
> 90 clients are connected to the server
> leader got killed and started
> the other 2 zookeeper started FLE and Leader was elected
> But its taking nearly 10 sec for this server to server requests and saying 
> "ZooKeeperServer not running" message..?
> Why is this even after Leader election SERVER IS NOT RUNNING !!!!!!!!!!
> 2011-12-19 16:12:29,732 [myid:2] - WARN  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxn@354] - Exception 
> causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not 
> running
> 2011-12-19 16:12:29,733 [myid:2] - INFO  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxn@1000] - Closed 
> socket connection for client /10.18.47.148:51965 (no session established for 
> client)
> 2011-12-19 16:12:29,753 [myid:2] - INFO  
> [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:2182:QuorumPeer@747] - LEADING
> 2011-12-19 16:12:29,762 [myid:2] - INFO  
> [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:2182:Leader@58] - TCP NoDelay set to: true
> 2011-12-19 16:12:29,765 [myid:2] - INFO  
> [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:2182:ZooKeeperServer@168] - Created 
> server with tickTime 2000 minSessionTimeout 4000 maxSessionTimeout 40000 
> datadir ../dataDir/version-2 snapdir ../dataDir/version-2
> 2011-12-19 16:12:29,766 [myid:2] - INFO  
> [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:2182:Leader@294] - LEADING - LEADER 
> ELECTION TOOK - 4663
> 2011-12-19 16:12:29,776 [myid:2] - INFO  
> [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:2182:FileSnap@83] - Reading snapshot 
> ../dataDir/version-2/snapshot.100013661
> 2011-12-19 16:12:29,831 [myid:2] - INFO  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxnFactory@213] - 
> Accepted socket connection from /10.18.47.148:51982
> 2011-12-19 16:12:29,831 [myid:2] - WARN  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxn@354] - Exception 
> causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not 
> running
> 2011-12-19 16:12:29,832 [myid:2] - INFO  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxn@1000] - Closed 
> socket connection for client /10.18.47.148:51982 (no session established for 
> client)
> 2011-12-19 16:12:29,884 [myid:2] - INFO  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxnFactory@213] - 
> Accepted socket connection from /10.18.47.148:51989
> 2011-12-19 16:12:29,884 [myid:2] - WARN  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2182:NIOServerCnxn@354] - Exception 
> causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not 
> running



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to