[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Hunt updated ZOOKEEPER-313:
-----------------------------------

    Fix Version/s: 3.1.1

> Problem with successive leader failures when no client is connected 
> --------------------------------------------------------------------
>
>                 Key: ZOOKEEPER-313
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-313
>             Project: Zookeeper
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 3.0.0, 3.0.1
>         Environment: all
>            Reporter: Sunanda Bera
>             Fix For: 3.1.1
>
>
> Steps to reproduce:
> Create a 3 node cluster . Run some transactions and then stop all clients. 
> Make sure no other clients connect for the duration of the test.
> Let L1 be the current leader. Bring down L1. Let L2 be the leader chosen.  
> Let the third node be N3. Note that this will increase the txn id for N3's 
> snapshot without any  transaction being logged. Now bring up L1 -- same will 
> happen for L1. Now bring down L2.
> Both N3 and L1 now have snapshots with a transaction id greater than the last 
> logged transaction. Whoever is elected leader will try to restore its state 
> from the filesystem and fail.
> One easy workaround is obviously to change the FileTxnSnapLog not to save a 
> snapshot if zxid > last logged zxid. The correct solution is possibly to log 
> a transaction for leader election as well.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to