Re: [ClusterLabs] Redundant ring not recovering after node is back

2018-08-22 Thread Jan Friesse
David, Hello, Im getting crazy about this problem, that I expect to resolve here, with your help guys: I have 2 nodes with Corosync redundant ring feature. Each node has 2 similarly connected/configured NIC's. Both nodes are connected each other by two crossover cables. I believe this is roo

Re: [ClusterLabs] Antw: Re: Spurious node loss in corosync cluster

2018-08-22 Thread Jan Friesse
Prasad, Hi - My systems are single core cpu VMs running on azure platform. I am Ok, now it make sense. I don't think you get too much guarantees in the cloud environment so quite a large scheduling pause simply can happen. Also single core CPU is kind of "unsupported" today. running MySQ

[ClusterLabs] Q: (SLES11 SP4) lrm_rsc_op without last-run?

2018-08-22 Thread Ulrich Windl
Hi! Many years ago I wrote a parser that could format the CIB XML in a flexible way. Today I used it again to print some statistics for "exec-time". Thereby I discovered one operation that has a valid "exec-time", a valid "last-rc-change", but no "last-run". All other operations had "last-run".

Re: [ClusterLabs] Redundant ring not recovering after node is back

2018-08-22 Thread Andrei Borzenkov
22.08.2018 15:53, David Tolosa пишет: > Hello, > Im getting crazy about this problem, that I expect to resolve here, with > your help guys: > > I have 2 nodes with Corosync redundant ring feature. > > Each node has 2 similarly connected/configured NIC's. Both nodes are > connected each other by t

Re: [ClusterLabs] Redundant ring not recovering after node is back

2018-08-22 Thread Emmanuel Gelati
Sorry a Typo I think you are mixing interface with nodelist http://clusterlabs.org/ pacemaker/doc/en-US/Pacemaker/1.1/html/Clusters_from_ Scratch/_sample_corosync_configuration.html 2018-08-22 22:20 GMT+02:00 Emmanuel Gelati : > I think you are missing interface with nodelist http://clusterlabs.

Re: [ClusterLabs] Redundant ring not recovering after node is back

2018-08-22 Thread Emmanuel Gelati
I think you are missing interface with nodelist http://clusterlabs.org/pacemaker/doc/en-US/Pacemaker/1.1/html/Clusters_from_Scratch/_sample_corosync_configuration.html 2018-08-22 14:53 GMT+02:00 David Tolosa : > Hello, > Im getting crazy about this problem, that I expect to resolve here, with > y

[ClusterLabs] Redundant ring not recovering after node is back

2018-08-22 Thread David Tolosa
Hello, Im getting crazy about this problem, that I expect to resolve here, with your help guys: I have 2 nodes with Corosync redundant ring feature. Each node has 2 similarly connected/configured NIC's. Both nodes are connected each other by two crossover cables. I configured both nodes with rrp

Re: [ClusterLabs] Antw: Re: Spurious node loss in corosync cluster

2018-08-22 Thread Prasad Nagaraj
Hi - My systems are single core cpu VMs running on azure platform. I am running MySQL on the nodes that do generate high io load. And my bad , I meant to say 'High CPU load detected' logged by crmd and not corosync. Corosync logs messages like 'Corosync main process was not scheduled for.' kind

Re: [ClusterLabs] Antw: Re: Spurious node loss in corosync cluster

2018-08-22 Thread Ferenc Wágner
Jan Friesse writes: > Is that system VM or physical machine? Because " Corosync main process > was not scheduled for..." is usually happening on VMs where hosts are > highly overloaded. Or when physical hosts use BMC watchdogs. But Prasad didn't encounter such logs in the setup at hand, as far