Re: [Pacemaker] Multi-level ACLs for the CIB

2010-02-22 Thread Yan Gao
Hi Andrew, On 02/08/10 17:48, Andrew Beekhof wrote: On Thu, Feb 4, 2010 at 5:24 PM, Yan Gao y...@novell.com wrote: And put exclusions for things like passwords before the read for the whole cib? Yes. We should specify any deny and write objects before it. I like the syntax now, but my

[Pacemaker] crm_mon with sending mails

2010-02-22 Thread Tom Tux
Hi all, I started crm_mon in the background to send me mails when a cluster-event occurs: crm_mon -d -H [smtp-server] -T [my_mailaddress] But when I manually stop or start a resource or take one node offline and online again, I did not receive an email. My question hereby: On which

Re: [Pacemaker] crm_mon with sending mails

2010-02-22 Thread Florian Haas
Check if you built with libesmtp-devel installed. Check if you have libesmtp installed. Then, run crm_mon -N - -H [smtp-server] -T [my_mailaddress] and trigger a resource transition. If it's not sending out an email, it will print an error message detailing the reason. Hope this helps.

[Pacemaker] HB 3.0.2 + Pacemaker 1.0.7-1 NOT working :(

2010-02-22 Thread Angie T. Muhammad
Hello I have the following packages installed on 2-node cluster: - pacemaker-1.0.7-1.el5 - pacemaker-libs-1.0.7-1.el5 - heartbeat-3.0.2-2.el5 - heartbeat-libs-3.0.2-2.el5 - cluster-glue-1.0.1-1.el5 - cluster-glue-libs-1.0.1-1.el5 - corosync-1.1.2-1.el5 - resource-agents-1.0.1-1.el5

[Pacemaker] heartbeat stop hangs sometimes

2010-02-22 Thread Markus M.
Hello, sometimes heartbeat stop seems to hang (latest packets from clusterlabs.org, RHEL5 x86_64, 2-node cluster with only one node running). The last lines from ha-debug are like this: Feb 22 12:52:48 dbprod21 ccm: [24053]: info: client (pid=24058) removed from ccm Feb 22 12:52:48 dbprod21

Re: [Pacemaker] HB 3.0.2 + Pacemaker 1.0.7-1 NOT working :(

2010-02-22 Thread Florian Haas
man ha.cf Note that ucast directives which go to the local machine are effectively ignored. This allows the ha.cf directives on all machines to be identical. Your ha.cf _must_ be identical across nodes. You got your ucast directives wrong. You need two of them. Hope this helps, Florian

Re: [Pacemaker] Problems settings up MySQL failover

2010-02-22 Thread Lior Marantenboim
Yes, both scripts work perfectly well manually. (the ones that come on /usr/lib/ocf/resource.d/heartbeat/) 2010/2/19 Angie T. Muhammad angie.taw...@gmail.com On Fri, Feb 19, 2010 at 7:19 PM, Lior Marantenboim lior...@gmail.comwrote: Hello everyone I've having trouble trying to set up a

Re: [Pacemaker] HB 3.0.2 + Pacemaker 1.0.7-1 NOT working :(

2010-02-22 Thread Angie T. Muhammad
Hello Florian I got your point and I switched to mcast directive and copied ha.cf using scp from node1 to node 2. Both files now look like this: keepalive 2 deadtime30 warntime10 initdead120 mcast eth0

Re: [Pacemaker] HB 3.0.2 + Pacemaker 1.0.7-1 NOT working :(

2010-02-22 Thread Angie T. Muhammad
Finally solved it, it is: *crm respawn* instead of *crm on* though the latter used to work peacefully with heartbeat 3.0.1 and multicast ! Thanks Florian for taking the time to help -- On Mon, Feb 22,

Re: [Pacemaker] [SOLVED] crm_mon with sending mails

2010-02-22 Thread Tom Tux
Hi Florian Thanks for your hint. The problem was, that the crm_mon tried to send mails over port tcp 587. I specified also the port 25 within the mail-host-option like this: --mail-option=[smtp_server:25] This works fine. Thanks a lot Tom 2010/2/22 Florian Haas florian.h...@linbit.com: Check

Re: [Pacemaker] cli configuration and resource sets

2010-02-22 Thread Dejan Muhamedagic
Hi, On Fri, Feb 19, 2010 at 05:27:51PM +0100, Cristian Mammoli - Apra Sistemi wrote: Andrew Beekhof wrote: On Fri, Feb 19, 2010 at 1:06 PM, Cristian Mammoli - Apra Sistemi c.mamm...@apra.it wrote: Dejan Muhamedagic wrote: Sorry, didn't read careful enough, was distracted by the actions in

Re: [Pacemaker] cli configuration and resource sets

2010-02-22 Thread Dejan Muhamedagic
Hi again, On Fri, Feb 19, 2010 at 05:27:51PM +0100, Cristian Mammoli - Apra Sistemi wrote: Andrew Beekhof wrote: On Fri, Feb 19, 2010 at 1:06 PM, Cristian Mammoli - Apra Sistemi c.mamm...@apra.it wrote: Dejan Muhamedagic wrote: Sorry, didn't read careful enough, was distracted by the

[Pacemaker] gfs2: ping_pong test fails

2010-02-22 Thread Raphael Daum
dear all, the ping_pong test (which can be found here: http://wiki.samba.org/index.php/Ping_pong) fails with pacemaker 1.0.7 and gfs_controld.pcmk (3.0.7) on a gfs2 filesystem. the fs is mounted by the cluster infrastructure, dlm_controld.pcmk and gfs_controld.pcmk are running. here comes the

Re: [Pacemaker] Problems settings up MySQL failover

2010-02-22 Thread Dejan Muhamedagic
Hi, On Fri, Feb 19, 2010 at 03:08:00PM -0300, Javier Charne wrote: Lior Marantenboim escribió: Hello everyone I've having trouble trying to set up a simple MySQL failover (make Pacemaker start it on one node, and if it shuts down, start it on the other). CRM shows as if MySQL is not

Re: [Pacemaker] Problems settings up MySQL failover

2010-02-22 Thread Dejan Muhamedagic
Hi, On Fri, Feb 19, 2010 at 02:19:15PM -0300, Lior Marantenboim wrote: Hello everyone I've having trouble trying to set up a simple MySQL failover (make Pacemaker start it on one node, and if it shuts down, start it on the other). CRM shows as if MySQL is not running but it actually is, and

[Pacemaker] Problem with pingd.

2010-02-22 Thread Jayakrishnan
Sir, I have setup a 2 node cluster with heartbeat 2.99 pacemaker 1.05. I am using Ubuntu 9.1. Both the packages are installed from ubuntu karmic repository. My packages are: heartbeat 2.99.2+sles11r9-5ubuntu1 heartbeat-common 2.99.2+sles11r9-5ubuntu1

Re: [Pacemaker] Problems settings up MySQL failover

2010-02-22 Thread Lior Marantenboim
Apparently commiting even if it complains that the slb resource does not exist, works. I have it working now with SLB instead of ocf: heartbeat. The next thing I'm trying to do is that if the cluster detects that the primary node has MySQL down, to switch it's service IP to the other node. Is

Re: [Pacemaker] Problem with pingd.

2010-02-22 Thread Andrew Beekhof
On Mon, Feb 22, 2010 at 6:46 PM, Jayakrishnan jayakrishnan...@gmail.com wrote: Sir, I have setup a 2 node cluster with heartbeat 2.99  pacemaker 1.05. I am using Ubuntu 9.1. Both the packages are installed from ubuntu karmic repository. My packages are: heartbeat  

Re: [Pacemaker] heartbeat stop hangs sometimes

2010-02-22 Thread Andrew Beekhof
On Mon, Feb 22, 2010 at 5:10 PM, Lars Ellenberg lars.ellenb...@linbit.com wrote: On Mon, Feb 22, 2010 at 01:00:29PM +0100, Markus M. wrote: Hello, sometimes heartbeat stop seems to hang (latest packets from clusterlabs.org, RHEL5 x86_64, 2-node cluster with only one node running). The last

Re: [Pacemaker] Multi-level ACLs for the CIB

2010-02-22 Thread Andrew Beekhof
On Mon, Feb 22, 2010 at 8:58 AM, Yan Gao y...@novell.com wrote: Hi Andrew, On 02/08/10 17:48, Andrew Beekhof wrote: On Thu, Feb 4, 2010 at 5:24 PM, Yan Gao y...@novell.com wrote: And put exclusions for things like passwords before  the read for the whole cib? Yes. We should specify any

Re: [Pacemaker] All dependend resources are restarted when Clone Set changes status on ONE machine

2010-02-22 Thread Andrew Beekhof
On Sat, Feb 13, 2010 at 9:14 PM, Sebastian Urban urban...@gmail.com wrote: Hi. I have two servers (cipvm1, cipvm2) and want to configure automatic failover of virtual machines (KVM via libvirt). Both servers share a virtual block device via DRBD in dual primary mode. I use an OCFS2

Re: [Pacemaker] We cannot stop specific clone resources by a crm_resource order.

2010-02-22 Thread Andrew Beekhof
On Wed, Feb 17, 2010 at 9:11 AM, renayama19661...@ybb.ne.jp wrote: Hi, We cannot stop specific clone resources by a crm_resource order. Last updated: Wed Feb 17 16:56:26 2010 Stack: openais Current DC: srv01 - partition with quorum Version:

Re: [Pacemaker] gfs2: ping_pong test fails

2010-02-22 Thread Andrew Beekhof
On Mon, Feb 22, 2010 at 4:12 PM, Raphael Daum raphael.d...@kurier.at wrote: dear all, the ping_pong test (which can be found here: http://wiki.samba.org/index.php/Ping_pong) fails with pacemaker 1.0.7 and gfs_controld.pcmk (3.0.7) on a gfs2 filesystem. the fs is mounted by the cluster

Re: [Pacemaker] heartbeat stop hangs sometimes

2010-02-22 Thread Lars Ellenberg
On Mon, Feb 22, 2010 at 08:46:23PM +0100, Andrew Beekhof wrote: On Mon, Feb 22, 2010 at 5:10 PM, Lars Ellenberg lars.ellenb...@linbit.com wrote: On Mon, Feb 22, 2010 at 01:00:29PM +0100, Markus M. wrote: Hello, sometimes heartbeat stop seems to hang (latest packets from

Re: [Pacemaker] We cannot stop specific clone resources by a crm_resource order.

2010-02-22 Thread renayama19661014
Hi Andrew, Can we stop one clone by a crm_resouce command and a crm command? Or is it necessary to set -INIFINITY in a rule? Right, a location rule Possibly was the clone the specifications that cannot stop individually? Correct. You can however do: crm configure location

Re: [Pacemaker] Problem with pingd.

2010-02-22 Thread Jayakrishnan
Sir, I am afraid to ask you but how can I tell pacemaker to compare as number instead of string. I changed -inf: to -1 in pingd location constarint but same problem persists. I also changer the global resource stickness to 1. but still not working. With thanks, Jayakrishnan.L On Tue, Feb

Re: [Pacemaker] [PATCH CTS] minor patch to call key_for_node from isolate_node()

2010-02-22 Thread Andrew Beekhof
Pushed On Tue, Feb 23, 2010 at 4:02 AM, Angus Salkeld asalk...@redhat.com wrote: Hi As the subject suggests. -Angus ___ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Re: [Pacemaker] Problem with pingd.

2010-02-22 Thread Andrew Beekhof
http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/ch-rules.html#id2083551 Look for the type field. On Tue, Feb 23, 2010 at 5:08 AM, Jayakrishnan jayakrishnan...@gmail.com wrote: Sir, I am afraid to ask you but how can I tell pacemaker to compare as number instead of

Re: [Pacemaker] Problem with pingd.

2010-02-22 Thread Jayakrishnan
Sir, Could you explain that a bit more. I have been reading the same document for 2 days and cant specify the type as integer. It is showing as Parsing error, do you want tot edit it again With lots of thanks Jayakrishnan. L On Tue, Feb 23, 2010 at 12:51 PM, Andrew Beekhof and...@beekhof.net