Re: [Pacemaker] rename all nodes of a cluster

2010-10-05 Thread Andrew Beekhof
Just bring up the cluster with the new names and use crm configure node delete to remove the old names. On Mon, Oct 4, 2010 at 2:12 PM, Karl Rößmann k.roessm...@fkf.mpg.de wrote: Hi Clusterlabs mailing list, I have a running cluster with three nodes. For some reason I had to change all host

Re: [Pacemaker] Fail over algorithm used by Pacemaker

2010-10-05 Thread Andrew Beekhof
On Sun, Oct 3, 2010 at 4:01 PM, hudan studiawan studia...@gmail.com wrote: Hi, I want to start to contribute to Pacemaker project. I start to read Documentation and try some basic configurations. I have a question: what kind of algorithm used by Pacemaker to choose another node when a node

Re: [Pacemaker] cib

2010-10-05 Thread Andrew Beekhof
On Fri, Oct 1, 2010 at 3:45 PM, Shravan Mishra shravan.mis...@gmail.com wrote: Hi, Just a quick question, who generates the very first cib.xml when pacemaker processes are initialized? The cib Thanks Shravan On Thu, Sep 30, 2010 at 4:22 AM, Andrew Beekhof and...@beekhof.net wrote: On

Re: [Pacemaker] Fail-count and failure timeout

2010-10-05 Thread Andrew Beekhof
On Fri, Oct 1, 2010 at 3:40 PM, holger.teut...@fresenius-netcare.com wrote: Hi, I observed the following in pacemaker Versions 1.1.3 and tip up to patch 10258. In a small test environment to study fail-count behavior I have one resource anything doing sleep 600 with monitoring interval 10

Re: [Pacemaker] resources are restarted without obvious reasons

2010-10-05 Thread Andrew Beekhof
On Fri, Oct 1, 2010 at 9:53 AM, Pavlos Parissis pavlos.paris...@gmail.com wrote: Hi, It seams that it happens every time PE wants to check the conf 09:23:55 crmd: [3473]: info: crm_timer_popped: PEngine Recheck Timer (I_PE_CALC) just popped! and then check_rsc_parameters() wants to reset my

Re: [Pacemaker] [Problem or Enhancement]When attrd reboots, a fail count is initialized.

2010-10-05 Thread renayama19661014
Hi Andrew, I registered these contents with Bugzilla as enhancement of the functions. * http://developerbugs.linux-foundation.org/show_bug.cgi?id=2501 Thanks, Hideo Yamauchi. --- renayama19661...@ybb.ne.jp wrote: Hi Andrew, Thank you for comment. Is the change of this attrd and

Re: [Pacemaker] Missing lrm_opstatus

2010-10-05 Thread Andrew Beekhof
Dejan: looks like something in the lrm library. Any idea why the message doesn't contain lrm_opstatus? lrm_targetrc also looks strange. On Thu, Sep 30, 2010 at 9:41 PM, Ron Kerry rke...@sgi.com wrote: Folks - I am seeing the following message sequence that results in a bogus declaration of

Re: [Pacemaker] crm_mon SNMP function

2010-10-05 Thread Michael Schhwartzkopff
On Monday 04 October 2010 15:00:25 mathias.enzensber...@knapp.com wrote: Hi all, I use openais/pacemaker v.1.1.2 on SLES 11.1 and would like to use the SNMP function of crm_mon. But this part is documented really scanty (e.g. the part for configuring SNMP notifications is blank). I found

Re: [Pacemaker] resources are restarted without obvious reasons

2010-10-05 Thread Pavlos Parissis
On 5 October 2010 11:15, Andrew Beekhof and...@beekhof.net wrote: On Fri, Oct 1, 2010 at 9:53 AM, Pavlos Parissis pavlos.paris...@gmail.com wrote: Hi, It seams that it happens every time PE wants to check the conf 09:23:55 crmd: [3473]: info: crm_timer_popped: PEngine Recheck Timer

[Pacemaker] init Script fails in 1 of LSB Compatible test

2010-10-05 Thread Pavlos Parissis
Hi, I am thinking to put under cluster control the sshd and I am checking if the /etc/init.d/sshd supplied by RedHat 5.4 is compatible with LSB. So, I run the test mentioned here [1] and it fails at test 6, it returns 1 and failed message. Could this create problems within pacemaker? Regards,

Re: [Pacemaker] Fail-count and failure timeout

2010-10-05 Thread Holger . Teutsch
The resource failed when the sleep expired, i.e. each 600 secs. Now I changed the resource to sleep 7200, failure-timeout 3600 i.e. to values far beyond the recheck-interval opf 15m. Now everything behaves as expected. Mit freundlichen Grüßen / Kind regards Holger Teutsch From:

Re: [Pacemaker] init Script fails in 1 of LSB Compatible test

2010-10-05 Thread Andrew Beekhof
On Tue, Oct 5, 2010 at 12:51 PM, Pavlos Parissis pavlos.paris...@gmail.com wrote: Hi, I am thinking to put under cluster control the sshd and I am checking if the /etc/init.d/sshd supplied by RedHat 5.4 is compatible with LSB. So, I run the test mentioned here [1] and it fails at test 6, it

Re: [Pacemaker] init Script fails in 1 of LSB Compatible test

2010-10-05 Thread Pavlos Parissis
On 5 October 2010 13:19, Andrew Beekhof and...@beekhof.net wrote: On Tue, Oct 5, 2010 at 12:51 PM, Pavlos Parissis pavlos.paris...@gmail.com wrote: Hi, I am thinking to put under cluster control the sshd and I am checking if the /etc/init.d/sshd supplied by RedHat 5.4 is compatible

Re: [Pacemaker] Dependency on either of two resources

2010-10-05 Thread Vladislav Bogdanov
05.10.2010 12:12, Andrew Beekhof wrote: On Mon, Oct 4, 2010 at 6:31 AM, Vladislav Bogdanov bub...@hoster-ok.com wrote: Hi all, just wondering, is there a way to make resource depend on (be colocated with) either of two other resources? Not yet. Its something we want to support

[Pacemaker] pacemaker version

2010-10-05 Thread Shravan Mishra
Hi, I was interested in knowing that if I have to choose between pacemaker 1.0 vs 1.1 which one should I use. Thanks Shravan ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home:

[Pacemaker] Online and Offline status when doing crm_mon

2010-10-05 Thread Mike A Meyer
We are setup in a two node active/passive cluster using pacemaker/corosync. We shutdown the pacemaker/corosync on both nodes and changed the uname -n on our nodes to show the short name instead of the FQDN. Started up pacemaker/corosync and ever since we done that, when we run the crm_mon command,

[Pacemaker] how to test network access and fail over accordingly?

2010-10-05 Thread Craig Hurley
Hello, I have a 2 node cluster, running DRBD, heartbeat and pacemaker in active/passive mode. On both nodes, eth0 is connected to the main network, eth1 is used to connect the nodes directly to each other. The nodes share a virtual IP address on eth0. Pacemaker is also controlling a custom