[jira] [Commented] (ZOOKEEPER-1777) Missing ephemeral nodes in one of the members of the ensemble

JIRA Fri, 04 Oct 2013 02:10:51 -0700

    [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13786011#comment-13786011
 ]


Germán Blanco commented on ZOOKEEPER-1777:
------------------------------------------

Hello Flavio,
I am sure that for me is a worrying case, I can't tell if it is a valid case 
for ZooKeeper in general.
The faults are the following (for a cluster of three nodes):
- One server fails, and stops for a while.
- During this while another server fails and loses all data.
If e.g. you have data in RAM, losing all data with a failure (and not losing it 
with other failures) may be quite frequent.
I agree that this requires failure of 2 out of three nodes, which should not 
happen, but in any case a result of permanent inconsistency is not acceptable 
to me. Nodes can be taken out for maintenance and the administrator will bring 
nodes up and down and never think that if he does so carelessly the system will 
be broken forever when it is taken out of maintenance.

> Missing ephemeral nodes in one of the members of the ensemble
> -------------------------------------------------------------
>
>                 Key: ZOOKEEPER-1777
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1777
>             Project: ZooKeeper
>          Issue Type: Bug
>          Components: quorum
>    Affects Versions: 3.4.5
>         Environment: Linux, Java 1.7
>            Reporter: Germán Blanco
>            Assignee: Germán Blanco
>            Priority: Blocker
>             Fix For: 3.4.6, 3.5.0
>
>         Attachments: logs_trunk.tar.gz, snaps.tar, ZOOKEEPER-1777.tar.gz
>
>
> In a 3-servers ensemble, one of the followers doesn't see part of the 
> ephemeral nodes that are present in the leader and the other follower. 
> The 8 missing nodes in "the follower that is not ok" were created in the end 
> of epoch 1, the ensemble is running in epoch 2.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (ZOOKEEPER-1777) Missing ephemeral nodes in one of the members of the ensemble

Reply via email to