Oh god
root@bhs1-mail03-ds03:~# zgrep "lease" /var/log/ceph/*.gz
/var/log/ceph/ceph-mon.bhs1-mail03-ds03.log.2.gz:2017-08-24 06:39:22.384112
7f44c60f1700 1 mon.bhs1-mail03-ds03@2(peon).paxos(paxos updating c
8973251..8973960) lease_timeout -- calling new election
I really need some help through this.
This is happening very frequently and I can't seem to figure out why.
My services rely on cephfs and when this happens, the mds suicides.
It's always the same, see the last occurrence logs:
host bhs1-mail03-ds03:
2017-08-19 06:35:54.072809 7f44c60f1700 1
Hi David,
thanks for your feedback.
With that in mind, I did rm a 15TB RBD Pool about 1 hour or so before this
had happened.
I wouldn't think it would be related to this because there was nothing
different going on after I removed it. Not even high system load.
But considering what you sid, I
I just want to point out that there are many different types of network
issues that don't involve entire networks. Bad nic, bad/loose cable, a
service on a server restarting our modifying the network stack, etc.
That said there are other things that can prevent an mds service, or any
service from