Re: [Pacemaker] failure handling on a cloned resource

2013-05-07 Thread Johan Huysmans
Hi, I only keep a couple of pe-input file, and that pe-inpurt-1 version was already overwritten. I redid my tests as describe in my previous mails. At the end of the test it was again written to pe-input1, which is included as attachment. gr. Johan On 2013-05-07 04:08, Andrew Beekhof

[Pacemaker] R: Frequent SBD triggered server reboots

2013-05-07 Thread andrea cuozzo
Hi, Here are three logs from the last server watchdog-driven reboot on friday evening (not that I want you to actually dig into them, it's just to update this thread with my new findings), with SBD watchdog timeout set to 20 seconds. 1) sar.txt is the output of sar -d -p- 2 (two seconds

Re: [Pacemaker] R: Frequent SBD triggered server reboots

2013-05-07 Thread emmanuel segura
Hello Andrea i think you need to think about that Lars told you = (Upgrade to SP2) or maybe you can try to use a diferent lun for the sbd and use ionice for setting the realtime class for sbd process 2013/5/7 andrea cuozzo andrea.cuo...@sysma.it Hi, Here are three logs from the last

Re: [Pacemaker] R: Frequent SBD triggered server reboots

2013-05-07 Thread Lars Marowsky-Bree
On 2013-05-07T10:43:55, emmanuel segura emi2f...@gmail.com wrote: Hello Andrea i think you need to think about that Lars told you = (Upgrade to SP2) or maybe you can try to use a diferent lun for the sbd and use ionice for setting the realtime class for sbd process SBD is already using

Re: [Pacemaker] Pacemaker core dumps

2013-05-07 Thread Xavier Lashmar
Hey Andrew, that is great news. Do you know when new RPMs with the updated source might be available? We are managing production servers and would rather continue using package management to update them. Otherwise we shall recompile if there is no alternative. Thanks very much for your

[Pacemaker] [Patch] pacemaker-mgmt/hbagent avoid coredump with pacemaker=1.1.8/corosync

2013-05-07 Thread Rainer Brestan
SNMP agent hbagent from pacemaker-mgmt produces segmentation fault if used with pacemaker=1.1.8 and corosync. The reason is function get_cib_fd in file hbagentv2.c. It tries to get the file descriptor with function pointer inputfd, which is not initialized any more since change of IPC to

Re: [Pacemaker] Corosync 2.3 dies randomly

2013-05-07 Thread Robert Parsons
Corosync has a blackbox - did you interrogate that too? I grabbed the latest source from github and the problem remains. Here's some diagnostic output: strace on the corosync process: recvmsg(12, {msg_name(16)={sa_family=AF_INET, sin_port=htons(8999), sin_addr=inet_addr(10.1.4.133)},

Re: [Pacemaker] pacemaker dev lead suggesting s/w upgrade

2013-05-07 Thread Andrew Beekhof
Please keep all questions on the mailing list... I don't have the bandwidth for 1-1 support. At a minimum, include (as attachments) logs from all machines. Also, since you're upgrading, can I suggest you go with 1.1.10-rc2. Even though its only a release candidate, its far superior to 1.1.8 --

Re: [Pacemaker] Pacemaker core dumps

2013-05-07 Thread Andrew Beekhof
On 07/05/2013, at 10:42 PM, Xavier Lashmar xlash...@uottawa.ca wrote: Hey Andrew, that is great news. Do you know when new RPMs with the updated source might be available? We're in the release candidate phase for 1.1.10, so when that is released in the coming days. Or you can use make rpm

[Pacemaker] Squid ocf agent. Temporaly unknown error stopping squid daemon.

2013-05-07 Thread Mauricio Esteban
Hello, I'm working on a ACTIVE/PASSIVE proxy cluster on debian 6.0.7. The cluster has two resources configured, an OCF IPaddr2 agent and a squid OCF agent. The IPaddr2 agent is working fine, but the squid agent shows a temporary error when I manually stop the service (/etc/init.d/squid stop) in

Re: [Pacemaker] crm_mon failed with upgrade failed message

2013-05-07 Thread Andrew Beekhof
On 07/05/2013, at 11:42 PM, Michal Fiala fi...@mobil.cz wrote: Hallo, I have updated corosync/pacemaker cluster, versions see bellow. Cluster is working fine, but when I change configuration via crm configure edit, crm_mon is exited with error message: Your current configuration could