Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-13 Thread Jan Friesse
Олег Самойлов napsal(a): 13 авг. 2019 г., в 15:55, Jan Friesse написал(а): There is going to be slightly different solution (set this timeouts based on corosync token timeout) which I'm working on, but it's kind of huge amount of work and not super high prio (workaround exists), so no ETA

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-13 Thread Олег Самойлов
> 13 авг. 2019 г., в 15:55, Jan Friesse написал(а): > > There is going to be slightly different solution (set this timeouts based on > corosync token timeout) which I'm working on, but it's kind of huge amount of > work and not super high prio (workaround exists), so no ETA yet. Is it will b

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-13 Thread Jan Friesse
Олег Самойлов napsal(a): 12 авг. 2019 г., в 8:46, Jan Friesse написал(а): Let me try to bring some light in there: - dpd_interval is qnetd variable how often qnetd walks thru the list of all clients (qdevices) and checks timestamp of last sent message. If diff between current timestamp an

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-13 Thread Олег Самойлов
> 12 авг. 2019 г., в 8:46, Jan Friesse написал(а): > > Let me try to bring some light in there: > > - dpd_interval is qnetd variable how often qnetd walks thru the list of all > clients (qdevices) and checks timestamp of last sent message. If diff between > current timestamp and last sent me

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-12 Thread Jan Friesse
Andrei Borzenkov napsal(a): Отправлено с iPhone 12 авг. 2019 г., в 8:46, Jan Friesse написал(а): Олег Самойлов napsal(a): 9 авг. 2019 г., в 9:25, Jan Friesse написал(а): Please do not set dpd_interval that high. dpd_interval on qnetd side is not about how often is the ping is sent. Could

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-11 Thread Andrei Borzenkov
Отправлено с iPhone > 12 авг. 2019 г., в 8:46, Jan Friesse написал(а): > > Олег Самойлов napsal(a): >>> 9 авг. 2019 г., в 9:25, Jan Friesse написал(а): >>> Please do not set dpd_interval that high. dpd_interval on qnetd side is not >>> about how often is the ping is sent. Could you please re

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-11 Thread Jan Friesse
Олег Самойлов napsal(a): 9 авг. 2019 г., в 9:25, Jan Friesse написал(а): Please do not set dpd_interval that high. dpd_interval on qnetd side is not about how often is the ping is sent. Could you please retry your test with dpd_interval=1000? I'm pretty sure it will work then. Honza Yep.

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-09 Thread Олег Самойлов
> 9 авг. 2019 г., в 9:25, Jan Friesse написал(а): > Please do not set dpd_interval that high. dpd_interval on qnetd side is not > about how often is the ping is sent. Could you please retry your test with > dpd_interval=1000? I'm pretty sure it will work then. > > Honza Yep. As far as I unde

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-09 Thread Jan Friesse
Andrei Borzenkov napsal(a): On Fri, Aug 9, 2019 at 9:25 AM Jan Friesse wrote: Олег Самойлов napsal(a): Hello all. I have a test bed with several virtual machines to test pacemaker. I simulate random failure on one of the node. The cluster will be on several data centres, so there is not st

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-09 Thread Andrei Borzenkov
On Fri, Aug 9, 2019 at 9:25 AM Jan Friesse wrote: > > Олег Самойлов napsal(a): > > Hello all. > > > > I have a test bed with several virtual machines to test pacemaker. I > > simulate random failure on one of the node. The cluster will be on several > > data centres, so there is not stonith devi

Re: [ClusterLabs] Strange lost quorum with qdevice

2019-08-08 Thread Jan Friesse
Олег Самойлов napsal(a): Hello all. I have a test bed with several virtual machines to test pacemaker. I simulate random failure on one of the node. The cluster will be on several data centres, so there is not stonith device, instead I use qnetd on the third data centre and watchdog (softdog)

[ClusterLabs] Strange lost quorum with qdevice

2019-08-08 Thread Олег Самойлов
Hello all. I have a test bed with several virtual machines to test pacemaker. I simulate random failure on one of the node. The cluster will be on several data centres, so there is not stonith device, instead I use qnetd on the third data centre and watchdog (softdog). And sometimes (not always