Matthew Hertz created ZOOKEEPER-3490:
----------------------------------------
Summary: Zookeeper follower's not reflecting writes (after months)
Key: ZOOKEEPER-3490
URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3490
Project: ZooKeeper
Issue Type: Bug
Affects Versions: 3.4.13
Reporter: Matthew Hertz
Hi,
We have a 3 node Zookeeper cluster. There are a number of znode's on the leader
that are not visible on the followers.
{code:java}
$ zkCli -server <server 1> (follower)
[zk: <server 1>(CONNECTED) 0] get
/pyMkdProducer/SNAP/lock/c4a62c9fdfdc412fac3818bbb2af3a0f__lock__0000000040
abcd.company.com:<built-in function getpid>
cZxid = 0xf00061d68
ctime = Thu Nov 01 12:40:33 GMT 2018
mZxid = 0xf00061d68
mtime = Thu Nov 01 12:40:33 GMT 2018
pZxid = 0xf00061d68
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x500be5318d60407
dataLength = 58
numChildren = 0
{code}
{code:java}
$ zkCli -server <server 2> (leader)
[zk: <server2>(CONNECTED) 0] get
/pyMkdProducer/SNAP/lock/c4a62c9fdfdc412fac3818bbb2af3a0f__lock__0000000040
Node does not exist:
/pyMkdProducer/SNAP/lock/c4a62c9fdfdc412fac3818bbb2af3a0f__lock__0000000040
{code}
{code:java}
$ zkCli -server <server 3> (follower)
[zk: <server3>(CONNECTED) 0] get
/pyMkdProducer/SNAP/lock/c4a62c9fdfdc412fac3818bbb2af3a0f__lock__0000000040
abcd.company.com:<built-in function getpid>
cZxid = 0xf00061d68
ctime = Thu Nov 01 12:40:33 GMT 2018
mZxid = 0xf00061d68
mtime = Thu Nov 01 12:40:33 GMT 2018
pZxid = 0xf00061d68
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x500be5318d60407
dataLength = 58
numChildren = 0
{code}
These nodes are ephemeral nodes. The sessions no longer exist. There are 6
znodes in this 'inconsistent' state. The cluster is currently connected - there
are no networking partitions currently.
We're at a loss for how to both debug and fix this. Restarting the Zookeeper
followers presumably will not help? Are all nodes ever force-synced from the
leader?
Help would be appreciated.
Thanks
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)