[Pacemaker] Problems with SBD

2015-01-04 Thread Oriol Mula-Valls
Hi everyone, I have a two node system with SLES 11 SP3 (pacemaker-1.1.9-0.19.102, corosync-1.4.5-0.18.15, sbd-1.1-0.13.153). Since desember we started to have several reboots of the system due to SBD; 22nd, 24th and 26th. Last reboot happened yesterday January 3rd. The message is the same all the

Re: [Pacemaker] Pacemaker + drbd + Cman Error: gfs_controld join connect error: Connection refused error mounting lockproto lock_dlm

2015-01-04 Thread raby
Hi this is the drbdadm dump *# resource pcmAppData on pcmk-1: not ignored, not stacked# defined at /etc/drbd.conf:10resource pcmAppData {on pcmk-1 { device /dev/drbd1 minor 1;disk /dev/mapper/VolGroup-drbd--demo;meta-diskinternal; address

Re: [Pacemaker] Pacemaker + drbd + Cman Error: gfs_controld join connect error: Connection refused error mounting lockproto lock_dlm

2015-01-04 Thread Digimer
You've disabled stonith, which alone is a very bad idea with DRBD and cman. Please enable it, configure and test stonith devices, and then hook DRBD into pacemaker using the 'fence-handler '/path/to/crm-fence-peer.sh' and set 'fencing resource-and-stonith'. Then configure cman to hook into

Re: [Pacemaker] Corosync 1.4.7: zombie (defunct)

2015-01-04 Thread Andrew Beekhof
pacemaker version? it looks familiar but it depends on the version number. On 29 Dec 2014, at 10:24 pm, Sergey Arlashin sergeyarl.maill...@gmail.com wrote: Hi! Recently I've noticed that one of my nodes had OFFLINE status in 'crm status' output. But it actually was not. I could ssh on

Re: [Pacemaker] Corosync 1.4.7: zombie (defunct)

2015-01-04 Thread Sergey Arlashin
Pacemaker 1.1.6 It runs on Ubuntu 12.04 LTS 64bit. Linux lb-node1 3.11.0-23-generic #40~precise1-Ubuntu SMP Wed Jun 4 22:06:36 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux -- Best regards, Sergey Arlashin On Jan 5, 2015, at 7:59 AM, Andrew Beekhof and...@beekhof.net wrote: pacemaker version?