Re: [Pacemaker] crm_mon and pingd

2010-11-11 Thread Keisuke MORI
No objections. I've push the changeset below: http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/53132ed532ea But it would be still preferable to rely on -A, particularly if you want to use two or more pingd resources or change the attribute name. I'd also agree for the filtering feature as an

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
This might be a little better: diff -r dd75da218e4f configure.ac --- a/configure.ac Fri Oct 29 12:12:45 2010 +0200 +++ b/configure.ac Tue Nov 09 13:20:55 2010 +0100 @@ -654,7 +654,7 @@ AC_MSG_RESULT(using $GLIBCONFIG) AC_CHECK_LIB(glib-2.0, g_hash_table_get_values) if test

[Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Simon Jansen
Hi everybody, I wrote a multistate resource agent to manage the Rsyslog service in a master/slave setup. In the master role the service is started with a predefined rsyslog_master.conf. In the slave role with a predefined rsyslog_slave.conf configuration file. The start operation of the agent

Re: [Pacemaker] How can I restart a clone resource on a specific node ?

2010-11-11 Thread Andrew Beekhof
On Tue, Nov 9, 2010 at 5:56 PM, oaidel oai...@cc.in2p3.fr wrote: Hello, I have a cluster 2 nodes (RL 5.2) using pacemaker (1.0.9) with corosync configuration crm_config cluster_property_set id=cib-bootstrap-options nvpair id=cib-bootstrap-options-dc-version name=dc-version

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew, Thanks for a revision. I confirmed completion of compiling it. I revised it a little, I attach a patch. Regards, Tomo 2010/11/11 Andrew Beekhof and...@beekhof.net This might be a little better: diff -r dd75da218e4f configure.ac --- a/configure.ac Fri Oct 29 12:12:45

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 10:26 AM, nozawat noza...@gmail.com wrote: Hi Andrew,  Thanks for a revision.  I confirmed completion of compiling it.  I revised it a little, I attach a patch. Thanks! Did you try running it? Regards, Tomo 2010/11/11 Andrew Beekhof and...@beekhof.net This

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 9:59 AM, Simon Jansen simon.jans...@googlemail.com wrote: Hi everybody, I wrote a multistate resource agent to manage the Rsyslog service in a master/slave setup. In the master role the service is started with a predefined rsyslog_master.conf. In the slave role with a

Re: [Pacemaker] Balancing of clone resources (globally-unique=true)

2010-11-11 Thread Andrew Beekhof
On Wed, Nov 10, 2010 at 1:51 PM, Chris Picton ch...@ecntelecoms.com wrote: On Wed, 10 Nov 2010 09:32:00 +0100, Andrew Beekhof wrote: what version is this? This is 1.0.9 Odd. I wouldn't have expected this behavior. Can you attach the output from cibadmin -Ql please? On Tue, Nov 9, 2010

Re: [Pacemaker] crazy notify with master/slave of group

2010-11-11 Thread Andrew Beekhof
On Tue, Nov 2, 2010 at 1:04 AM, Alan Jones falanclus...@gmail.com wrote: Hi, Pacemaker 1.0.9.1, Corosync 1.2.7 I have a sane master/slave configuration that gives me normal looking notify() calls when I standby each node in turn. However, when I configure the master/slave on a group of three

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Dejan Muhamedagic
Hi, On Thu, Nov 11, 2010 at 11:11:14AM +0100, Andrew Beekhof wrote: On Thu, Nov 11, 2010 at 9:59 AM, Simon Jansen simon.jans...@googlemail.com wrote: Hi everybody, I wrote a multistate resource agent to manage the Rsyslog service in a master/slave setup. In the master role the service

Re: [Pacemaker] Cotrolled migration of resources

2010-11-11 Thread Andrew Beekhof
On Fri, Oct 29, 2010 at 4:18 AM, Matthew F. Hymowitz mhymow...@gmpnet.net wrote: I am building a two node active/passive cluster supporting iSCSITarget,iSCSILogicalUnit and ocf:linbit:drbd.   I can hard power down a primary node (i.e. pull the power cord ) and all my resource migrate correctly

Re: [Pacemaker] using xml for rules

2010-11-11 Thread Andrew Beekhof
On Sun, Oct 17, 2010 at 7:26 PM, Pavlos Parissis pavlos.paris...@gmail.com wrote: Hi, I am trying to make a rule to control the failback on the resources. I want during working days from 06:00 to 23:00 and on weekend from 08:00 to 16:00 to have resource-stickiness 1000 and on the left hours

Re: [Pacemaker] stand_alone_ping stop Node start

2010-11-11 Thread Andrew Beekhof
On Thu, Oct 14, 2010 at 3:47 AM, jiaju liu liujiaj...@yahoo.com.cn wrote: Hi I reboot my node, and it appears node2 pingd: [3932]: info: stand_alone_ping: Node 192.168.10.100 is unreachable (read) and the node could not start 192.168.10.100 is ib network I will start ib after the node

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Andrew Beekhof
On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu dfri...@streamwide.ro wrote: Hi all, I've managed to make this setup work, basically the issue with a symmetric-cluster=false and specifying the resources' location manually means that the resources will always obey the location constraint, and (as

Re: [Pacemaker] Multistate Resources is not promoted automatically

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 11:38 AM, Simon Jansen simon.jans...@googlemail.com wrote: Hi Andrew, thank you for your answer. Does the ocf:heartbeat:Rsyslog script call crm_master? It needs to to tell pacemaker which instance to promote. Yes it does. But I forgot to call crm_master with the

Re: [Pacemaker] [Problem]Number of times control of the fail-count is late.

2010-11-11 Thread Andrew Beekhof
On Wed, Nov 10, 2010 at 5:20 AM, renayama19661...@ybb.ne.jp wrote: Hi, We constituted a cluster by two node constitution. The migration-threshold set it to 2. We confirmed a phenomenon in the next procedure. Step1) Start two nodes and send config5.crm. (The clnDiskd-resources is

Re: [Pacemaker] [Question]About the recovery procedure from the state that a node was divided.

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 4, 2010 at 2:44 AM, renayama19661...@ybb.ne.jp wrote: Hi All, We tested it about the recovery procedure from the state that a node was divided. (As for four nodes, three nodes are active, and one node is constitution of the standby.) It is the restoration from a state

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew, I'm sorry. The contents of the core file were wrong. I am as follows definitely. - $ gdb /usr/lib64/heartbeat/pengine core.27920 GNU gdb Fedora (6.8-37.el5) Copyright (C) 2008 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 12:31 PM, nozawat noza...@gmail.com wrote: Hi Andrew,  I ran it. However, an error has been output.  Probably I have a feeling that glib does not move well.  I attached ha-log.  I feel like cannot read a library well.  It is contents of core as follows. you'll need

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Dan Frincu
Hi, Andrew Beekhof wrote: On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu dfri...@streamwide.ro wrote: Hi all, I've managed to make this setup work, basically the issue with a symmetric-cluster=false and specifying the resources' location manually means that the resources will always obey the

[Pacemaker] drbd-xen and fencing

2010-11-11 Thread Vadym Chepkov
Hi, I posted a less elaborate version of this question to drbd mail-list, but, unfortunately, didn't get a reply, maybe audience of this list has more experience. I am trying to make xen live migration to work reliably, but wasn't successful so far. Here is the problem. In a cluster

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Pavlos Parissis
On 11 November 2010 13:04, Dan Frincu dfri...@streamwide.ro wrote: Hi, Andrew Beekhof wrote: On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu dfri...@streamwide.ro wrote: Hi all, I've managed to make this setup work, basically the issue with a symmetric-cluster=false and specifying the

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread nozawat
Hi Andrew Sorry,pengine output a core. - gdb) where #0 0x2b247fa8b53a in g_list_nth_data () from /lib64/libglib-2.0.so.0 #1 0x2b247ebc5027 in g_hash_table_iter_next (iter=0x7fff1568e4c0, key=0x0, value=0x7fff1568e4e0) at ../include/crm/common/util.h: 348 #2 0x2b247ebc9301

Re: [Pacemaker] Pacemaker-1.1.4, when?

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 3:50 PM, nozawat noza...@gmail.com wrote: Hi Andrew  Sorry,pengine output a core. could you go up to frame #1 and run: print *iter print *values - gdb) where #0  0x2b247fa8b53a in g_list_nth_data () from /lib64/libglib-2.0.so.0 #1  0x2b247ebc5027 in

Re: [Pacemaker] Infinite fail-count and migration-threshold after node fail-back

2010-11-11 Thread Dan Frincu
Hi, Pavlos Parissis wrote: On 11 November 2010 13:04, Dan Frincu dfri...@streamwide.ro wrote: Hi, Andrew Beekhof wrote: On Mon, Oct 11, 2010 at 9:40 AM, Dan Frincu dfri...@streamwide.ro wrote: Hi all, I've managed to make this setup work, basically the issue with a

Re: [Pacemaker] stand_alone_ping stop Node start

2010-11-11 Thread jiaju liu
?product=Pacemaker -- next part -- An HTML attachment was scrubbed... URL: http://oss.clusterlabs.org/pipermail/pacemaker/attachments/2010/4d7f3ea1/attachment-0001.htm -- Message: 3 Date: Thu, 11 Nov 2010 11:38:24 +0100 From: Simon Jansen

[Pacemaker] start filesystem like this is right?

2010-11-11 Thread jiaju liu
start resource steps step(1) crm configure primitive vol_mpath0 ocf:heartbeat:Filesystem meta target-role=stopped params device=/dev/mapper/mpath0 directory=/mnt/mapper/mpath0 fstype='lustre' op start timeout=300s  op stop timeout=120s op monitor timeout=120s interval=60s op notify timeout=60s

Re: [Pacemaker] Balancing of clone resources (globally-unique=true)

2010-11-11 Thread Chris Picton
I have attached the output as requested On Thu, 11 Nov 2010 11:21:51 +0100, Andrew Beekhof wrote: what version is this? This is 1.0.9 Odd. I wouldn't have expected this behavior. Can you attach the output from cibadmin -Ql please? On Tue, Nov 9, 2010 at 5:51 PM, Chris Picton

Re: [Pacemaker] symmetric anti-collocation

2010-11-11 Thread Andrew Beekhof
On Thu, Nov 11, 2010 at 9:02 PM, Alan Jones falanclus...@gmail.com wrote: How to I express symmetric anti-collocation in Pacemaker 1.0.9.1? I'd like to write two rules: colocation X-Y -2: X Y colocation Y-X -2: Y X the second one is implied by the first and is therefore redundant The idea