Re: [Pacemaker] pacemaker version

2010-10-07 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 1:51 PM, Vadym Chepkov vchep...@gmail.com wrote: On Oct 6, 2010, at 2:48 AM, Andrew Beekhof wrote: On Tue, Oct 5, 2010 at 7:53 PM, Shravan Mishra shravan.mis...@gmail.com wrote: Hi, I was interested in knowing that if I have to choose between pacemaker 1.0 vs 1.1

Re: [Pacemaker] pacemaker version

2010-10-07 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi gianluca.cec...@gmail.com wrote: On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra shravan.mis...@gmail.com wrote: That is what I heard too, that's the reason for this question. On June, inside a complex thread regarding colocation -inf, Andrew

[Pacemaker] [Problem]The monitor that start-delay is long does not stop.

2010-10-07 Thread renayama19661014
Hi, I operated the next to confirm the contribution of the mailing list. * http://www.gossamer-threads.com/lists/linuxha/pacemaker/66939 Step1) I prepare cib.xml having monitor which set start-delay than five minutes.. Step2) I start two nodes and send cib. Last updated: Thu

Re: [Pacemaker] resource stop timeout broken in 1.0 branch tip

2010-10-07 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 11:29 AM, Keisuke MORI keisuke.mori...@gmail.com wrote: 2010/10/6 Andrew Beekhof and...@beekhof.net: Is there more changesets that need to be backported regarding to this issues? There is now that Andreas brought the problem to my attention :-)  

Re: [Pacemaker] how to test network access and fail over accordingly?

2010-10-07 Thread Andrew Beekhof
On Wed, Oct 6, 2010 at 10:21 PM, Craig Hurley li...@thehurley.com wrote: I tried using ping instead of pingd and I added number to the evaluation, I get the same results :/ primitive p_ping ocf:pacemaker:ping params host_list=172.20.0.254 clone c_ping p_ping meta globally-unique=false

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Andrew Beekhof
On Sat, Oct 2, 2010 at 6:31 PM, Pavlos Parissis pavlos.paris...@gmail.com wrote: Hi, I am having again the same issue, in a different set of 3 nodes. When I try to failover manually the resource group on the standby node, the ms-drbd resource is not moved as well and as a result the resource

Re: [Pacemaker] Problem with log level

2010-10-07 Thread Andrew Beekhof
Could you look for CRM Hg Version: in the logs please? Perhaps the logging macro was broken in that version. Strange. On Tue, Oct 5, 2010 at 1:51 PM, Eberhard Kuemmerle e.kuemme...@fz-juelich.de wrote: Hi, I use pacemaker 1.1.2.1 + corosync 1.2.1 (on openSuse 11.3). Logging is configured in

Re: [Pacemaker] syslog-ng as resource / how to make sure it gets restarted

2010-10-07 Thread Andrew Beekhof
On Fri, Oct 1, 2010 at 9:41 AM, Koch, Sebastian sebastian.k...@netzwerk.de wrote: Hi Andrew, thanks for your answer. I still need syslog-ng to restart on all nodes after the ClusterIp moved. I tried it like this: Resource: primitive res_SyslogNG lsb:syslog-ng \     op monitor

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Andrew Beekhof
On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic deja...@fastmail.fm wrote: Hi, On Tue, Oct 05, 2010 at 11:18:37AM +0200, Andrew Beekhof wrote: Dejan: looks like something in the lrm library. Any idea why the message doesn't contain lrm_opstatus? Becase this monitor operation never run.

Re: [Pacemaker] [Problem]The monitor that start-delay is long does not stop.

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 8:39 AM, renayama19661...@ybb.ne.jp wrote: Hi, I operated the next to confirm the contribution of the mailing list.  * http://www.gossamer-threads.com/lists/linuxha/pacemaker/66939 Step1) I prepare cib.xml having monitor which set start-delay than five minutes..

Re: [Pacemaker] Election Timeout and node became the Pending state.

2010-10-07 Thread Andrew Beekhof
On Tue, Oct 5, 2010 at 6:44 AM, renayama19661...@ybb.ne.jp wrote: Hi, We tested complicated node trouble. An error of Election Timeout occurred then.  * Pacemaker:pacemaker-1.0.9.1  * heartbeat-3.0.3-2.3.el5  * cluster-glue:cluster-glue-1.0.6-1.6.el5  *

Re: [Pacemaker] Can somebody please explain pengine's urge to move all resources?

2010-10-07 Thread Raoul Bhatia [IPAX]
On 10/06/2010 11:16 AM, Keisuke MORI wrote: This should have been fix with this: http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/5fe02f48c47b The patch has been already backported to the 1.0 repository and will be included in 1.0.10. Will you test with the tip of 1.0 repository if you

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 11:13 AM, Dejan Muhamedagic deja...@fastmail.fm wrote: On Thu, Oct 07, 2010 at 09:49:05AM +0200, Andrew Beekhof wrote: On Tue, Oct 5, 2010 at 1:50 PM, Dejan Muhamedagic deja...@fastmail.fm wrote: Hi, On Tue, Oct 05, 2010 at 11:18:37AM +0200, Andrew Beekhof wrote:

Re: [Pacemaker] Can somebody please explain pengine's urge to move all resources?

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 11:02 AM, Raoul Bhatia [IPAX] r.bha...@ipax.at wrote: On 10/06/2010 11:16 AM, Keisuke MORI wrote: This should have been fix with this: http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/5fe02f48c47b The patch has been already backported to the 1.0 repository and will

Re: [Pacemaker] Backports from 1.1 to 1.0

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 10:55 AM, Raoul Bhatia [IPAX] r.bha...@ipax.at wrote: hi all, do you have any further information, eta, repository, etc. in regard of the backported patches from 1.1 to 1.0? I saw a bunch go into stable-1.0 the other day, so I think they're done. I just need to find

Re: [Pacemaker] About behavior in Action Lost.

2010-10-07 Thread Keisuke MORI
Andrew, 2010/9/23 Andrew Beekhof and...@beekhof.net: Pushed as:   http://hg.clusterlabs.org/pacemaker/1.1/rev/8433015faf18 Not sure about applying to 1.0 though, its a dramatic change in behavior. I would like to backport this to 1.0. Would you agree with this? Without this the failed node

Re: [Pacemaker] About behavior in Action Lost.

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 11:48 AM, Keisuke MORI keisuke.mori...@gmail.com wrote: Andrew, 2010/9/23 Andrew Beekhof and...@beekhof.net: Pushed as:   http://hg.clusterlabs.org/pacemaker/1.1/rev/8433015faf18 Not sure about applying to 1.0 though, its a dramatic change in behavior. I would like

Re: [Pacemaker] Problem with log level

2010-10-07 Thread Eberhard Kümmerle
I have solved the problem. There was an error in corosync.conf in a section before the logging section so that the logging section was'nt interpreted correctly. Thank you!

Re: [Pacemaker] starting a xen-domU depending on available hardware-resources using SysInfo-RA

2010-10-07 Thread Dejan Muhamedagic
Hi, On Thu, Sep 30, 2010 at 08:52:16AM -0400, Vadym Chepkov wrote: On Sep 30, 2010, at 2:35 AM, Sascha Reimann wrote: Hi Dejan, it's working fine with the amount of free ram as the score and a bigger default-resource-stickiness: primitive v01 ocf:heartbeat:Xen \ params

Re: [Pacemaker] stonith resource issue

2010-10-07 Thread Dejan Muhamedagic
Hi, On Wed, Oct 06, 2010 at 01:32:06PM -0400, Shravan Mishra wrote: Please fine hb_report. hb_report couldn't find the logs, probably because you have both syslog and to file logging. Anyway, it could be that stuff such as external/safe/ipmi cannot work, i.e. that you can't create

Re: [Pacemaker] Missing lrm_opstatus

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 6:06 PM, Ron Kerry rke...@sgi.com wrote: On 10/7/2010 8:00 AM, Andrew Beekhof wrote: On Thu, Oct 7, 2010 at 11:13 AM, Dejan Muhamedagic deja...@fastmail.fm wrote:   On Thu, Oct 07, 2010 at 09:49:05AM +0200, Andrew Beekhof wrote:   On Tue, Oct 5, 2010 at 1:50 PM, Dejan

Re: [Pacemaker] pacemaker version

2010-10-07 Thread Pavlos Parissis
On 7 October 2010 08:33, Andrew Beekhof and...@beekhof.net wrote: On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi gianluca.cec...@gmail.com wrote: On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra shravan.mis...@gmail.com wrote: That is what I heard too, that's the reason for this question.

[Pacemaker] stonith pacemaker problem

2010-10-07 Thread Shravan Mishra
Hi, Description of my environment: corosync=1.2.8 pacemaker=1.1.3 Linux= 2.6.29.6-0.6.smp.gcc4.1.x86_64 #1 SMP We are having a problem with our pacemaker which is continuously canceling the monitoring operation of our stonith devices. We ran: stonith -d -t external/safe/ipmi

Re: [Pacemaker] [Problem]The monitor that start-delay is long does not stop.

2010-10-07 Thread renayama19661014
Hi Andrew, Thank you for comment. Funnily enough I was just looking at that message and saw that the code relevant to this one looked wrong too. I believe this should fix the issue: http://hg.clusterlabs.org/pacemaker/1.1/rev/e06810256413 I registered log and more with Bugzilla.

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread jiaju liu
the latest from code Mercurial? Maybe you should clear failcount 1.1 or 1.2 branch? -- next part -- An HTML attachment was scrubbed... URL: http://oss.clusterlabs.org/pipermail/pacemaker/attachments/20101007/ce6d0b4e/attachment-0001.htm

Re: [Pacemaker] crm resource move doesn't move the resource

2010-10-07 Thread Pavlos Parissis
On 8 October 2010 04:26, jiaju liu liujiaj...@yahoo.com.cn wrote: Message: 2 Date: Thu, 7 Oct 2010 21:58:29 +0200 From: Pavlos Parissis pavlos.paris...@gmail.comhttp://cn.mc157.mail.yahoo.com/mc/compose?to=pavlos.paris...@gmail.com To: The Pacemaker cluster resource manager

Re: [Pacemaker] pacemaker version

2010-10-07 Thread Andrew Beekhof
On Thu, Oct 7, 2010 at 10:10 PM, Pavlos Parissis pavlos.paris...@gmail.com wrote: On 7 October 2010 08:33, Andrew Beekhof and...@beekhof.net wrote: On Wed, Oct 6, 2010 at 5:04 PM, Gianluca Cecchi gianluca.cec...@gmail.com wrote: On Wed, Oct 6, 2010 at 4:25 PM, Shravan Mishra