Re: [Pacemaker] chicken-egg-problem with libvirtd and a VM within cluster

2012-10-12 Thread Florian Haas
On Fri, Oct 12, 2012 at 3:18 AM, Andrew Beekhof and...@beekhof.net wrote: This has been a topic that has popped up occasionally over the years. Unfortunately we still don't have a good answer for you. The least worst practice has been to have the RA return OCF_STOPPED for non-recurring

Re: [Pacemaker] FYI/RFC: Name for 'system service' alias

2012-06-26 Thread Florian Haas
On Mon, Jun 25, 2012 at 1:40 PM, Andrew Beekhof and...@beekhof.net wrote: I've added the concept of a 'system service' that expands to whatever standard the local machine supports. So you could say, in xml, primitive id=Magic class=system type=mysql and the cluster would use 'lsb' on RHEL,

Re: [Pacemaker] How to write a master/slave resource-script

2012-06-16 Thread Florian Haas
On 06/15/12 16:37, Andrew Beekhof wrote: On Fri, Jun 15, 2012 at 12:19 AM, Stallmann, Andreas astallm...@conet.de wrote: Hi! Excuse my blindness; I found the „Stateful“ script, which is obviously the template / skeleton I was looking for. Unfortunately it comes without explanaition. Does

Re: [Pacemaker] General question about pacemaker

2012-06-10 Thread Florian Haas
On Sun, Jun 10, 2012 at 3:07 PM, Stefan Günther smguent...@web.de wrote: Hello, I have a general question about the features of pacemaker. We are planning to setup a HA solution with pacemaker, corosync and drbd. After a failure of the master at later its recovery, drbd will sync the data

Re: [Pacemaker] Problem with state: UNCLEAN (OFFLINE)

2012-06-08 Thread Florian Haas
On Fri, Jun 8, 2012 at 1:01 PM, Juan M. Sierra jmsie...@cica.es wrote: Problem with state: UNCLEAN (OFFLINE) Hello, I'm trying to get up a directord service with pacemaker. But, I found a problem with the unclean (offline) state. The initial state of my cluster was this: Online: [ node2

Re: [Pacemaker] KVM DRBD and Pacemaker

2012-06-05 Thread Florian Haas
On Tue, Jun 5, 2012 at 1:55 AM, Cliff Massey cliffm...@cliffmassey.com wrote: My config is: http://pastebin.com/5qYiHe56 Yep, you completely forgot your order and colo constraints. You need those to tie your foo-kvm primitive to its corresponding ms-foo master/slave set.

Re: [Pacemaker] Announce: pcs / pcs-gui (Pacemaker/Corosync Configuration System)

2012-06-05 Thread Florian Haas
On Mon, Jun 4, 2012 at 3:21 AM, Andrew Beekhof and...@beekhof.net wrote: On Sat, Jun 2, 2012 at 12:56 AM, Florian Haas flor...@hastexo.com wrote: On Fri, Jun 1, 2012 at 1:40 AM, Chris Feist cfe...@redhat.com wrote: I'd like to announce the existence of the Pacemaker/Corosync configuration

Re: [Pacemaker] Announce: pcs / pcs-gui (Pacemaker/Corosync Configuration System)

2012-06-05 Thread Florian Haas
On Mon, Jun 4, 2012 at 1:02 PM, Lars Marowsky-Bree l...@suse.com wrote: I am getting a slightly defensive-to-aggressive vibe from your response to Florian. Can we tune that down? I much prefer to do the shouting at each other in person, because then the gestures come across much more vividly

Re: [Pacemaker] Announce: pcs / pcs-gui (Pacemaker/Corosync Configuration System)

2012-06-05 Thread Florian Haas
On Tue, Jun 5, 2012 at 1:43 AM, Andrew Beekhof and...@beekhof.net wrote: On Mon, Jun 4, 2012 at 9:02 PM, Lars Marowsky-Bree l...@suse.com wrote: On 2012-06-04T11:21:57, Andrew Beekhof and...@beekhof.net wrote: Hi Andrew, I am getting a slightly defensive-to-aggressive vibe from your response

Re: [Pacemaker] [Help] Pacemaker + Oracle Listener

2012-06-05 Thread Florian Haas
On Wed, Jun 6, 2012 at 12:44 AM, Paul Damken zen.su...@gmail.com wrote: Im facing issues with my cluster setup. N+1 Pacemaker Hosting Oracle 11g Instances. Node name azteca I cannot get oralsnr to start my DB listener, it refuses on both nodes. Oracle RA is starting first, after all File

Re: [Pacemaker] KVM DRBD and Pacemaker

2012-06-04 Thread Florian Haas
On Mon, Jun 4, 2012 at 9:51 PM, Cliff Massey cliffm...@cliffmassey.com wrote:  I am trying to setup a cluster consisting of KVM DRBD and pacemaker. Without pacemaker DRBD and KVM are working. I can even stop everything on one node, promote the other to drbd primary and start the KVM machine

Re: [Pacemaker] [RFC] [Patch] DC node preferences (dc-priority)

2012-05-25 Thread Florian Haas
On Fri, May 25, 2012 at 10:45 AM, Lars Ellenberg lars.ellenb...@linbit.com wrote: Sorry, sent to early. That would not catch the case of cluster partitions joining, only the pacemaker startup with fully connected cluster communication already up. I thought about a dc-priority default of

Re: [Pacemaker] [RFC] [Patch] DC node preferences (dc-priority)

2012-05-25 Thread Florian Haas
On Fri, May 25, 2012 at 11:38 AM, Lars Ellenberg lars.ellenb...@linbit.com wrote: On Fri, May 25, 2012 at 11:15:32AM +0200, Florian Haas wrote: On Fri, May 25, 2012 at 10:45 AM, Lars Ellenberg lars.ellenb...@linbit.com wrote: Sorry, sent to early. That would not catch the case of cluster

Re: [Pacemaker] DRBD LVM EXT4 NFS performance

2012-05-21 Thread Florian Haas
On Sun, May 20, 2012 at 12:05 PM, Christoph Bartoschek po...@pontohonk.de wrote: Hi, we have a two node setup with drbd below LVM and an Ext4 filesystem that is shared vi NFS. The system shows low performance and lots of timeouts resulting in unnecessary failovers from pacemaker. The

Re: [Pacemaker] Is synchronizing rmtab needed?

2012-05-21 Thread Florian Haas
On Mon, May 21, 2012 at 1:36 AM, Christoph Bartoschek po...@pontohonk.de wrote: Hi, we currently have the problem that when the NFS server is highly used the heartbeat:exportfs monitor script fails with a timeout because it cannot write the rmtab to the exported filesystem within the given

Re: [Pacemaker] question about stonith:external/libvirt

2012-05-21 Thread Florian Haas
On Sun, May 20, 2012 at 6:40 AM, Matthew O'Connor m...@ecsorl.com wrote: After using the tutorial on the Hastexo site for setting up stonith via libvirt, I believe I have it working correctly...but...some strange things are happening.  I have two nodes, with shared storage provided by a

Re: [Pacemaker] question about stonith:external/libvirt

2012-05-21 Thread Florian Haas
On Mon, May 21, 2012 at 8:14 PM, Matthew O'Connor m...@ecsorl.com wrote: On 05/21/2012 05:43 AM, Florian Haas wrote: Does it have fencing resource-and-stonith in the DRBD configuration, and stonith_admin-fence-peer.sh as its fence-peer handler? That was the problem.  Totally forgot to update

Re: [Pacemaker] Can Corosync bind to two networks

2012-05-12 Thread Florian Haas
On Sat, May 12, 2012 at 2:49 AM, Steve Davidson steve.david...@pearl.com wrote: We want to run the Corosync heartbeat on the private net and, as a backup heartbeat, allow Corosync heartbeat on our public net as well. Thus in /etc/corosync/corosync.conf we need something like:

Re: [Pacemaker] pacemaker+ocfs2 +RAC

2012-04-05 Thread Florian Haas
On Mon, Apr 2, 2012 at 7:00 AM, Ruwan Fernando ruwanm...@gmail.com wrote: Hi, I was required to build oracle cluster so I configured pacemaker+ corosync+drbd+ocfs2 and built Active-active cluster. Why? pacemaker+corosync+drbd+xfs+oracle works just fine and is fully integrated with Pacemaker.

Re: [Pacemaker] socket is incremented after running crm shell

2012-04-03 Thread Florian Haas
On Tue, Apr 3, 2012 at 5:53 PM, David Vossel dvos...@redhat.com wrote: I see the same thing.  I'm using the latest pacemaker source from the master branch, so this definitely still exists.  For me the file leak occurs every time I issue a cibadmin --replace --xml-file command.  The shell is

Re: [Pacemaker] Pacemaker 1.1.7 now available

2012-04-02 Thread Florian Haas
On Mon, Apr 2, 2012 at 11:33 AM, Andrew Beekhof and...@beekhof.net wrote: On Fri, Mar 30, 2012 at 8:33 PM, Florian Haas flor...@hastexo.com wrote: On Fri, Mar 30, 2012 at 10:37 AM, Andrew Beekhof and...@beekhof.net wrote: I blogged about it, which automatically got sent to twitter, and I

Re: [Pacemaker] Corosync with puppet

2012-04-02 Thread Florian Haas
On Mon, Apr 2, 2012 at 11:34 AM, Hugo Deprez hugo.dep...@gmail.com wrote: Dear community, I am using a puppet mode in order to manage my cluster. I get a weird thing with the start stop of the corosync daemon. When I modify the corosync.conf file, puppet is asked to restart / reload

Re: [Pacemaker] OCF_RESKEY_CRM_meta_{ordered,notify,interleave}

2012-04-02 Thread Florian Haas
On Mon, Apr 2, 2012 at 11:54 AM, Andrew Beekhof and...@beekhof.net wrote: On Fri, Mar 30, 2012 at 7:34 PM, Florian Haas flor...@hastexo.com wrote: On Fri, Mar 30, 2012 at 1:12 AM, Andrew Beekhof and...@beekhof.net wrote: Because it was felt that RAs shouldn't need to know. Those options change

Re: [Pacemaker] OCF_RESKEY_CRM_meta_{ordered,notify,interleave}

2012-04-02 Thread Florian Haas
On Mon, Apr 2, 2012 at 12:32 PM, Andrew Beekhof and...@beekhof.net wrote: Well, but you did read the technical reason I presented here? Yes, and it boiled down to don't let the user hang themselves. Which is a noble goal, I just don't like the way we're achieving it. Why not advertise the

Re: [Pacemaker] Migration of lower resource causes dependent resources to restart

2012-03-30 Thread Florian Haas
On Thu, Mar 29, 2012 at 8:35 AM, Andrew Beekhof and...@beekhof.net wrote: On Thu, Mar 29, 2012 at 5:28 PM, Vladislav Bogdanov bub...@hoster-ok.com wrote: Hi Andrew, all, Pacemaker restarts resources when resource they depend on (ordering only, no colocation) is migrated. I mean that when I

Re: [Pacemaker] OCF_RESKEY_CRM_meta_{ordered,notify,interleave}

2012-03-30 Thread Florian Haas
On Fri, Mar 30, 2012 at 1:12 AM, Andrew Beekhof and...@beekhof.net wrote: Because it was felt that RAs shouldn't need to know. Those options change pacemaker's behaviour, not the RAs. But subsequently, in lf#2391, you convinced us to add notify since it allowed the drbd agent to error out if

Re: [Pacemaker] Pacemaker 1.1.7 now available

2012-03-30 Thread Florian Haas
On Fri, Mar 30, 2012 at 10:37 AM, Andrew Beekhof and...@beekhof.net wrote: I blogged about it, which automatically got sent to twitter, and I updated the IRC channel topic, but alas I forgot to mention it here :-) So in case you missed it, 1.1.7 is finally out. Special mention is due to

Re: [Pacemaker] Nodes not rejoining cluster

2012-03-30 Thread Florian Haas
On Fri, Mar 30, 2012 at 5:38 PM, Gregg Stock gr...@damagecontrolusa.com wrote: I took the last 200 lines of each. Can you check the health of the Corosync membership, as per this URL? http://www.hastexo.com/resources/hints-and-kinks/checking-corosync-cluster-membership Do _all_ nodes agree on

Re: [Pacemaker] Nodes not rejoining cluster

2012-03-30 Thread Florian Haas
On Fri, Mar 30, 2012 at 6:09 PM, Gregg Stock gr...@damagecontrolusa.com wrote: That looks good. They were all the same and had the correct ip addresses. So you've got both healthy rings, and all 5 nodes have 5 members in the membership list? Then this would make it a Pacemaker problem. IIUC the

Re: [Pacemaker] manually failing back resources when set sticky

2012-03-30 Thread Florian Haas
On Fri, Mar 30, 2012 at 8:26 PM, Brian J. Murrell br...@interlinx.bc.ca wrote: In my cluster configuration, each resource can be run on one of two node and I designate a primary and a secondary using location constraints such as: location FOO-primary FOO 20: bar1 location FOO-secondary FOO

Re: [Pacemaker] Issue with ordering

2012-03-29 Thread Florian Haas
On Thu, Mar 29, 2012 at 10:07 AM, Vladislav Bogdanov bub...@hoster-ok.com wrote: Hi Andrew, all, I'm continuing experiments with lustre on stacked drbd, and see following problem: At the risk of going off topic, can you explain *why* you want to do this? If you need a distributed, replicated

Re: [Pacemaker] Issue with ordering

2012-03-29 Thread Florian Haas
On Thu, Mar 29, 2012 at 11:40 AM, Vladislav Bogdanov bub...@hoster-ok.com wrote: Hi Florian, 29.03.2012 11:54, Florian Haas wrote: On Thu, Mar 29, 2012 at 10:07 AM, Vladislav Bogdanov bub...@hoster-ok.com wrote: Hi Andrew, all, I'm continuing experiments with lustre on stacked drbd

Re: [Pacemaker] resources show as running on all nodes right after adding them

2012-03-28 Thread Florian Haas
On Wed, Mar 28, 2012 at 4:26 PM, Brian J. Murrell br...@interlinx.bc.ca wrote: We seem to have occasion where we find crm_resource reporting that a resource is running on more (usually all!) nodes when we query right after adding it: # crm_resource -resource chalkfs-OST_3 --locate

Re: [Pacemaker] resources show as running on all nodes right after adding them

2012-03-28 Thread Florian Haas
On Wed, Mar 28, 2012 at 5:07 PM, Brian J. Murrell br...@interlinx.bc.ca wrote: On 12-03-28 10:39 AM, Florian Haas wrote: Probably because your resource agent reports OCF_SUCCESS on a probe operation To be clear, is this the status $OP in the agent? Nope, monitor. Of course, in your

[Pacemaker] High Performance High Availability Guide: new community documentation project

2012-03-23 Thread Florian Haas
Hi everyone, for those interested in contributing to a community documentation project focusing on performance optimization in high availability clusters, please take a look at the following URLs: https://github.com/fghaas/hp-ha-guide (GitHub repo) http://www.hastexo.com/node/173 (blog post --

Re: [Pacemaker] Resource-level fencing without stonith?

2012-03-23 Thread Florian Haas
On Fri, Mar 23, 2012 at 6:07 PM, Lajos Pajtek lajospaj...@yahoo.com wrote: Hi, I am building a two-node, active-standby cluster with shared storage. I think I got the basic primitives right, but fencing, implemented using SCSI persistent reservations, gives me some headache.First, I am

Re: [Pacemaker] Resource Agent ethmonitor

2012-03-21 Thread Florian Haas
On Tue, Mar 20, 2012 at 4:18 PM, Fiorenza Meini fme...@esseweb.eu wrote: Hi there, has anybody configured successfully the RA specified in the object of the message? I got this error: if_eth0_monitor_0 (node=fw1, call=2297, rc=-2, status=Timed Out): unknown exec error Your ethmonitor RA

Re: [Pacemaker] [Openstack] Howto Nova setup with HA?

2012-03-21 Thread Florian Haas
Hi everyone, apologies for the cross-post; I believe this might be interesting to people on both the openstack and the pacemaker lists. Please see below. On Tue, Feb 14, 2012 at 9:07 AM, i3D.net - Tristan van Bokkem tristanvanbok...@i3d.nl wrote: Hi Stackers, It seems running Openstack

Re: [Pacemaker] How can I preview the shadow configuration?

2012-03-20 Thread Florian Haas
On Tue, Mar 20, 2012 at 11:15 AM, Rasto Levrinc rasto.levr...@gmail.com wrote: 2012/3/20 Mars gu gukaicoros...@163.com: Hi,     I want to excute the command ,the problem occurred: [root@h10_148 ~]# ptest -bash: ptest: command not found How can I preview the shadow configuration? ptest

Re: [Pacemaker] Using shadow configurations noninteractively

2012-03-19 Thread Florian Haas
On Mon, Mar 19, 2012 at 8:00 PM, Phil Frost p...@macprofessionals.com wrote: I'm attempting to automate my cluster configuration with Puppet. I'm already using Puppet to manage the configuration of my Xen domains. I'd like to instruct puppet to apply the configuration (via cibadmin) to a

Re: [Pacemaker] How to setup STONITH in a 2-node active/passive linux HA pacemaker cluster?

2012-03-19 Thread Florian Haas
On Mon, Mar 19, 2012 at 8:14 PM, Mathias Nestler mathias.nest...@barzahlen.de wrote: Hi everyone, I am trying to setup an active/passive (2 nodes) Linux-HA cluster with corosync and pacemaker to hold a PostgreSQL-Database up and running. It works via DRBD and a service-ip. If node1 fails,

Re: [Pacemaker] Using shadow configurations noninteractively

2012-03-19 Thread Florian Haas
On Mon, Mar 19, 2012 at 9:00 PM, Phil Frost p...@macprofessionals.com wrote: On Mar 19, 2012, at 15:22 , Florian Haas wrote: On Mon, Mar 19, 2012 at 8:00 PM, Phil Frost p...@macprofessionals.com wrote: I'm attempting to automate my cluster configuration with Puppet. I'm already using Puppet

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 10:13 AM, ruslan usifov ruslan.usi...@gmail.com wrote: Hello I search a solution for scalable block device (dist that can extend if we add some machines to cluster). Only what i find accepten on my task is ceph + RDB, but ceph on my test i very unstable(regulary crash

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 11:06 AM, Vladislav Bogdanov bub...@hoster-ok.com wrote: 16.03.2012 12:13, ruslan usifov wrote: Hello I search a solution for scalable block device (dist that can extend if we add some machines to cluster). Only what i find accepten on my task is ceph + RDB, but ceph

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 11:14 AM, Lars Marowsky-Bree l...@suse.com wrote: On 2012-03-16T11:13:17, Florian Haas flor...@hastexo.com wrote: Which Ceph version are you using? Both the Ceph daemons and RBD are fully integrated into Pacemaker in upstream git. https://github.com/ceph/ceph/tree

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 12:50 PM, ruslan usifov ruslan.usi...@gmail.com wrote: I crash i have follow stack trcae How about taking that to the ceph-devel list? Florian -- Need help with High Availability? http://www.hastexo.com/now ___ Pacemaker

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 12:42 PM, Lars Marowsky-Bree l...@suse.com wrote: On 2012-03-16T11:28:36, Florian Haas flor...@hastexo.com wrote: is there a reason for integrating ceph with pacemaker? ceph does internal monitoring of OSTs etc anyway, doesn't it? Assuming you're referring to OSDs

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 12:24 PM, ruslan usifov ruslan.usi...@gmail.com wrote: Luster looks very cool and stability, but it doesn't provide scalable block device (Ceph allow it throw RDB), require patched kernel (i doesn't find more modern patched kernels for ubuntu lucid), so i think that it

Re: [Pacemaker] offtopic scalable block-device

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 4:55 PM, Lars Marowsky-Bree l...@suse.com wrote: On 2012-03-16T13:36:34, Florian Haas flor...@hastexo.com wrote: Would this not be more readily served by a simple while loop doing the monitoring, even if systemd/upstart aren't around? Pacemaker is kind of a heavy

Re: [Pacemaker] getting started - crm hangs when adding resources, even crm ra classes hangs

2012-03-14 Thread Florian Haas
On Wed, Mar 14, 2012 at 2:16 PM, Dejan Muhamedagic deja...@fastmail.fm wrote: Hi, On Tue, Mar 13, 2012 at 05:59:35PM -0400, Phillip Frost wrote: On Mar 13, 2012, at 2:21 PM, Jake Smith wrote: From: Phillip Frost p...@macprofessionals.com Subject: [Pacemaker] getting started - crm hangs

Re: [Pacemaker] getting started - crm hangs when adding resources, even crm ra classes hangs

2012-03-14 Thread Florian Haas
On Wed, Mar 14, 2012 at 2:37 PM, Phillip Frost p...@macprofessionals.com wrote: On Mar 14, 2012, at 9:25 AM, Florian Haas wrote: Do you have upstart at all? In that case, the debian package shouldn't have the upstart enabled when building cluster-glue. The current cluster-glue package

Re: [Pacemaker] getting started - crm hangs when adding resources, even crm ra classes hangs

2012-03-14 Thread Florian Haas
On Wed, Mar 14, 2012 at 4:58 PM, Phillip Frost p...@macprofessionals.com wrote: Can you confirm that you're running the ~bpo60+2 (note trailing 2) build, that you're actually running an lrmd binary from that version (meaning: that you properly killed your lrmd prior to installing that

Re: [Pacemaker] 1.1.6 rpm build for RHEL5

2012-03-10 Thread Florian Haas
On Sat, Mar 10, 2012 at 12:39 AM, Larry Brigman larry.brig...@gmail.com wrote: I have looked and cannot seem to find the pre-built 1.1.6 rpm set in the clusterlabs repo. It ships with RHEL/CentOS 6.2. On RHEL 5 however, 1.1.6 doesn't build. If you don't want to wait for 1.1.7, you'll either

Re: [Pacemaker] DRBD M/S Promotion

2012-03-10 Thread Florian Haas
On Fri, Mar 9, 2012 at 11:24 PM, Scott Piazza scott.pia...@bespokess.com wrote: I have a two-node active/passive pacemaker cluster running with a single DRBD resource set up as master-slave.  Today, we restarted both servers in the cluster, and when they came back up, both started pacemaker and

Re: [Pacemaker] Surprisingly fast start of resources on cluster failover.

2012-03-07 Thread Florian Haas
On Tue, Mar 6, 2012 at 1:49 PM, Florian Crouzat gen...@floriancrouzat.net wrote: I have Florian's rsyslog config: https://github.com/fghaas/pacemaker/blob/syslog/extra/rsyslog/pacemaker.conf.in I should mention that that rsyslog configuration is no longer being considered for upstream

[Pacemaker] What's the exact booth revision that ships in SLES 11 SP2?

2012-03-06 Thread Florian Haas
Jiaju, would you mind pushing your git tags your GitHub booth repo? Currently, as far as I can see, there are no tags in that repo at all. It would be nice to be able to find out what exactly is the git revision that you guys ship in SP2. Thanks! Cheers, Florian -- Need help with High

Re: [Pacemaker] stonith in a virtual cluster

2012-02-29 Thread Florian Haas
Jean-François, I realize I'm late to this discussion, however allow me to chime in here anyhow: On Mon, Feb 27, 2012 at 11:45 PM, Jean-Francois Malouin jean-francois.malo...@bic.mni.mcgill.ca wrote: Have you looked at fence_virt? http://www.clusterlabs.org/wiki/Guest_Fencing Yes I did. I

[Pacemaker] OCFS2 in Pacemaker, post Corosync 2.0

2012-02-29 Thread Florian Haas
Andrew, just a quick question out of curiosity: the ocf:pacemaker:o2cb resource and ocfs2_controld.pcmk require the OpenAIS CKPT service which is currently deprecated (as all of OpenAIS) and going away completely (IIUC) with Corosync 2.0. Does that mean that OCFS2 will be unsupported from

Re: [Pacemaker] Upstart resources

2012-02-28 Thread Florian Haas
2012/2/27 Ante Karamatić ante.karama...@canonical.com: On 27.02.2012 12:27, Florian Haas wrote: Alas, to the best of my knowledge the only way to change a specific job's respawn policy is by modifying its job definition. Likewise, that's the only way to enable or disable starting on system

Re: [Pacemaker] Question about master/slave resource promotion

2012-02-25 Thread Florian Haas
On Sat, Feb 25, 2012 at 12:31 AM, David Vossel dvos...@redhat.com wrote: Hey, I have a 2 node cluster with a multi-state master/slave resource. When the multi-state resources start up on each node they enter the Slave role.  At that point I can't figure out how to promote the resource to

Re: [Pacemaker] Last chance to object to the syntax for cluster tickets (multi-site clusters)

2012-02-24 Thread Florian Haas
On 02/24/12 02:53, Andrew Beekhof wrote: We're about to lock in the syntax for cluster tickets (used for multi-ste clusters). The syntax rules are at: https://github.com/gao-yan/pacemaker/commit/9e492f6231df2d8dd548f111a2490f02822b29ea And its use, along with some examples, can be

Re: [Pacemaker] Pacemaker will not mount ocfs2

2012-02-24 Thread Florian Haas
On 02/24/12 08:50, Johan Rosing Bergkvist wrote: Hi Just an update. So I upgraded to pacemaker 1.1.6 and tried to configure it all again, without dlm. It didn't work, I still got the OCF_ERR_INSTALLED so I started looking through the setup and found that I didn't specify the drbd.conf path.

Re: [Pacemaker] Pacemaker will not mount ocfs2

2012-02-24 Thread Florian Haas
On 02/24/12 09:21, Johan Rosing Bergkvist wrote: Sorry parameter, you're right. But still It didn't mount untill I added the drbdconf parameter. primitive clusterDRBD ocf:linbit:drbd \ params drbd_resource=cluster-ocfs *drbdconf=/etc/drbd.conf *#This is what I added \ op

[Pacemaker] DRBD, Fedora, and systemd (tangent off of Re: Upstart resources)

2012-02-24 Thread Florian Haas
On 02/23/12 23:48, Andrew Beekhof wrote: On Thu, Feb 23, 2012 at 6:31 PM, Ante Karamatic iv...@ubuntu.com wrote: On 23.02.2012 00:10, Andrew Beekhof wrote: Do you still have LSB scripts on a machine thats using upstart? Yes, some LSB scripts can't be easily converted to upstart jobs. Or,

Re: [Pacemaker] Pacemaker will not mount ocfs2

2012-02-21 Thread Florian Haas
On 02/21/12 13:39, Johan wrote: I've been following this http://publications.jbfavre.org/virtualisation/cluster- xen-corosync-pacemaker-drbd-ocfs2.en tutorial on how to setup a pacemaker xen cluster. I'm all new to this so pls bear over with me. The big problem is that when UI get to the

Re: [Pacemaker] Requesting re-sync

2012-02-21 Thread Florian Haas
On Tue, Feb 21, 2012 at 3:57 PM, Pieter Baele pieter.ba...@gmail.com wrote: After upgrading a node (RHEL 6.1 to 6.2), my /var/log/messages grows really really fast because of this error, what can be wrong? So you upgraded just one node, and the other is still unchanged? Can you give the

Re: [Pacemaker] Upstart resources

2012-02-21 Thread Florian Haas
Jake, sorry, I missed your original post due to travel; let me toss in one more thing here: On Tue, Feb 21, 2012 at 3:32 PM, Jake Smith jsm...@argotec.com wrote: Are upstart jobs expected to conform to the LSB spec with regards to exit codes, etc? Is there any reference documentation using

Re: [Pacemaker] Pacemaker will not mount ocfs2

2012-02-21 Thread Florian Haas
On Tue, Feb 21, 2012 at 4:22 PM, Dejan Muhamedagic deja...@fastmail.fm wrote: Hi, On Tue, Feb 21, 2012 at 02:26:31PM +0100, Florian Haas wrote: On 02/21/12 13:39, Johan wrote: I keep getting the:  info: RA output: (Cluster-FS-Mount:1:start:stderr) FATAL: Module scsi_hostadapter

Re: [Pacemaker] problems with cman + corosync + pacemaker in debian

2012-02-19 Thread Florian Haas
On 02/18/12 10:59, diego fanesi wrote: are you saying I can install drbd + gfs2 + pacemaker without using cman? It seems that gfs2 depends on cman... Only on RHEL/CentOS/Fedora. Not on Debian. I want to realize active/active cluster and I'm following the document cluster from scratch that

Re: [Pacemaker] Resource inter-dependency without being a 'group'

2012-02-18 Thread Florian Haas
On Sat, Feb 18, 2012 at 7:19 PM, David Coulson da...@davidcoulson.net wrote: I have an active/active LVS cluster, which uses pacemaker for managing IP resources. Currently I have one environment running on it which utilizes ~30 IP addresses, so a group was created so all resources could be

Re: [Pacemaker] problems with cman + corosync + pacemaker in debian

2012-02-17 Thread Florian Haas
On Sun, Feb 12, 2012 at 10:01 PM, diego fanesi diego.fan...@gmail.com wrote: Hi, I'm trying to install corosync with pacemaker using drbd + gfs2 with cman support. Why? GFS2 with dual-Primary DRBD with Pacemaker 1.1.6 is working very well in squeeze-backports with the dlm_controld.pcmk and

Re: [Pacemaker] Percona Replication Manager

2012-02-10 Thread Florian Haas
On Fri, Feb 10, 2012 at 1:38 PM, Nick Khamis sym...@gmail.com wrote: May I ask where the original blog resides? The one with the bizerk blog comments http://www.lmgtfy.com/?q=percona+replication+managerl=1 SCNR. :) Florian -- Need help with High Availability? http://www.hastexo.com/now

Re: [Pacemaker] LVM Setup

2012-01-26 Thread Florian Haas
On Wed, Jan 25, 2012 at 6:49 PM, Gregg Stock gr...@damagecontrolusa.com wrote: Hi, I'm trying to setup a 5 node cluster, the same topology as described in Roll Your Own Cloud: Enterprise Virtualization with KVM, DRBD, iSCSI and Pacemaker

Re: [Pacemaker] MySQL Master-Master replication with Corosync and Pacemaker

2012-01-25 Thread Florian Haas
On Thu, Jan 26, 2012 at 12:43 AM, Peter Scott pe...@psdt.com wrote: Hello.  Our problem is that a Corosync restart on the idle machine in a 2-node cluster shutds down the mysqld process there and we need it to stay up for replication. Well if you just want to restart Corosync by administrative

Re: [Pacemaker] [PATCH 0/2] rsyslog/logrotate configuration snippets

2012-01-15 Thread Florian Haas
On Sun, Jan 15, 2012 at 9:27 PM, Andrew Beekhof and...@beekhof.net wrote: On Thu, Jan 12, 2012 at 11:01 PM, Florian Haas flor...@hastexo.com wrote: On Thu, Jan 5, 2012 at 10:15 PM, Florian Haas flor...@hastexo.com wrote: Florian Haas (2):      extra: add rsyslog configuration snippet

Re: [Pacemaker] [PATCH 0/2] rsyslog/logrotate configuration snippets

2012-01-15 Thread Florian Haas
On Mon, Jan 16, 2012 at 10:59 AM, Andrew Beekhof and...@beekhof.net wrote: By Nuclear, I meant nothing at all from Pacemaker. Which is not what it does. If thats what you want, there's a far easier way to achieve this and keep usable logs around for debugging, set facility to none and add a

Re: [Pacemaker] need cluster-wide variables

2012-01-12 Thread Florian Haas
On Tue, Jan 10, 2012 at 10:24 PM, Arnold Krille arn...@arnoldarts.de wrote: Is it possible for slaves to modify their score for promotion? I think that would be an interesting feature. Probably something like that could already be achieved with dependency-rules and variables. But I think a

Re: [Pacemaker] [PATCH 0/2] rsyslog/logrotate configuration snippets

2012-01-12 Thread Florian Haas
On Thu, Jan 5, 2012 at 10:15 PM, Florian Haas flor...@hastexo.com wrote: Florian Haas (2):      extra: add rsyslog configuration snippet      extra: add logrotate configuration snippet  configure.ac                      |    4 +++  extra/Makefile.am                 |    2 +-  extra

Re: [Pacemaker] [PATCH 0/2] rsyslog/logrotate configuration snippets

2012-01-12 Thread Florian Haas
On Thu, Jan 12, 2012 at 2:15 PM, Vladislav Bogdanov bub...@hoster-ok.com wrote: I marked that message as Important and will include into my builds even if it does not go upstream. One question - does it break default hb_report and crm_report behavior? Good point. I presume it would make sense

Re: [Pacemaker] Configuring 3rd Node as Quorum Node in 2 Node Cluster

2012-01-10 Thread Florian Haas
On Wed, Jan 11, 2012 at 1:44 AM, Andrew Beekhof and...@beekhof.net wrote: On Wed, Jan 11, 2012 at 3:30 AM, Andrew Martin amar...@xes-inc.com wrote: 3. Limit the DRBD, nfs, and smbd resources to only node1 and node2 by adding a location rule for the g_nfs group (which includes p_fs_drbd0

Re: [Pacemaker] Cannot Create Primitive in CRM Shell

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 11:42 AM, Dan Frincu df.clus...@gmail.com wrote: Hi, On Fri, Jan 6, 2012 at 11:24 PM, Andrew Martin amar...@xes-inc.com wrote: Hello, I am working with DRBD + Heartbeat + Pacemaker to create a 2-node highly-available cluster. I have been following this official guide

Re: [Pacemaker] Resource ping fails on passive node after upgrading to second nic

2012-01-09 Thread Florian Haas
Stefan, sorry, your report triggers a complete -EPARSE in my brain. On Mon, Jan 9, 2012 at 10:38 AM, Senftleben, Stefan (itsc) stefan.senftle...@itsc.de wrote: Hello everybody, last week I installed and configured in each cluster node a second network interface. After configuring the

Re: [Pacemaker] Resource ping fails on passive node after upgrading to second nic

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 2:01 PM, Senftleben, Stefan (itsc) stefan.senftle...@itsc.de wrote: This is the cibadmin dump of the active one: http://pastebin.com/Yg4Jsaxy You would see this in a crm_mon -rf: Failed actions: pri_ping:1_start_0 (node=lxds05, call=-1, rc=1, status=Timed Out):

Re: [Pacemaker] syslog full of redundand link messages

2012-01-09 Thread Florian Haas
On Mon, Jan 9, 2012 at 3:15 PM, Attila Megyeri amegy...@minerva-soft.com wrote: Hi, I might be taking something wrong, but, bindnetaddr: 10.100.1.255 does not mean it will listen on this address, but will listen on every interface where this mask matches. This is just to make the config

[Pacemaker] [PATCH 1/2] extra: add rsyslog configuration snippet

2012-01-05 Thread Florian Haas
--- configure.ac|4 extra/Makefile.am |2 +- extra/rsyslog/Makefile.am |5 + extra/rsyslog/pacemaker.conf.in | 39 +++ 4 files changed, 49 insertions(+), 1 deletions(-) create mode 100644

[Pacemaker] [PATCH 0/2] rsyslog/logrotate configuration snippets

2012-01-05 Thread Florian Haas
character from qb_ipc_response_header definition (2012-01-03 11:38:46 +1100) are also available in my git repository at: git://github.com/fghaas/pacemaker syslog Florian Haas (2): extra: add rsyslog configuration snippet extra: add logrotate configuration snippet configure.ac

[Pacemaker] [PATCH 2/2] extra: add logrotate configuration snippet

2012-01-05 Thread Florian Haas
--- extra/Makefile.am |2 +- extra/logrotate/Makefile.am |5 + extra/logrotate/pacemaker.conf.in |7 +++ 3 files changed, 13 insertions(+), 1 deletions(-) create mode 100644 extra/logrotate/Makefile.am create mode 100644

Re: [Pacemaker] Patch: use NFSv4 with RA nfsserver

2011-12-27 Thread Florian Haas
On Tue, Dec 27, 2011 at 12:05 PM, Vogt Josef josef.v...@telecom.li wrote: Hi all, I wrote a patch to the ressource agent nfsserver which deals with NFSv4 (see attachment). It's now possible to use either NFSv3 or NFSv4 with this ressource agent. Any specific reason for not using exportfs?

Re: [Pacemaker] Patch: use NFSv4 with RA nfsserver

2011-12-27 Thread Florian Haas
On Tue, Dec 27, 2011 at 3:30 PM, Vogt Josef josef.v...@telecom.li wrote: Just a question here: I could't get it to work without setting the gracetime - which isn't set in the exportfs RA. Are you sure this works as expected? Thanks, good input. I'd be happy to add that (as in,

Re: [Pacemaker] OCFS2 problems when connectivity lost

2011-12-21 Thread Florian Haas
2011/12/21 Ivan Savčić | Epix ivan.sav...@epix.rs: Hello, We are having a problem with a 3-node cluster based on Pacemaker/Corosync with 2 primary DRBD+OCFS2 nodes and a quorum node. Nodes run on Debian Squeeze, all packages are from the stable branch except for Corosync (which is from

Re: [Pacemaker] More then one stonith-resource on one node

2011-12-20 Thread Florian Haas
On Tue, Dec 20, 2011 at 3:42 PM, Marc K. marcus.k...@stuttgart.de wrote: Hello together, I found an older Posting from September this year, with the same problem: - a two node cluster - every node has two power supplies - power supply one is connected to wti-powerswitch 1 - power supply two

Re: [Pacemaker] Doc: Resource templates

2011-12-12 Thread Florian Haas
On Mon, Dec 12, 2011 at 10:04 AM, Gao,Yan y...@suse.com wrote: On 12/12/11 15:55, Gao,Yan wrote: Hi, As some people have noticed, we've provided a new feature Resource templates since pacemaker-1.1.6. I made a document about it which is meant to be included into Pacemaker_Explained. I

Re: [Pacemaker] Doc: Resource templates

2011-12-12 Thread Florian Haas
On Mon, Dec 12, 2011 at 11:20 AM, Gao,Yan y...@suse.com wrote: On 12/12/11 17:52, Florian Haas wrote: On Mon, Dec 12, 2011 at 10:36 AM, Gao,Yan y...@suse.com wrote: On 12/12/11 17:16, Florian Haas wrote: On Mon, Dec 12, 2011 at 10:04 AM, Gao,Yan y...@suse.com wrote: On 12/12/11 15:55, Gao,Yan

Re: [Pacemaker] right way to update resource configuration on a live cluster?

2011-12-09 Thread Florian Haas
On Fri, Dec 9, 2011 at 10:25 PM, MA Martin Andrews (5542) mandr...@ag.com wrote: I have several heartbeat clusters running Centos 5 and heartbeat 2.1.4. Argll. Please: http://www.linux-ha.org/doc/users-guide/_upgrading_from_crm_enabled_heartbeat_2_1_clusters.html Is this procedure

Re: [Pacemaker] Fw: Unable to start pacemaker due to WARN: do_cib_control: Couldn't complete CIB registration [In reply to]

2011-12-06 Thread Florian Haas
Hi Graham, On Tue, Dec 6, 2011 at 8:06 AM, Graham Rawolle rawol...@daintreesystems.com wrote: I too am having all sorts of dramas getting pacemaker to start. Andrew you mentioned the new way “ver:1” to start the pacemaker daemons. The problem is that the two packaged versions of pacemaker

Re: [Pacemaker] CMAN - Pacemaker - Porftpd setup

2011-12-06 Thread Florian Haas
Hello, On Tue, Dec 6, 2011 at 2:36 PM, Bensch, Kobus kobus.ben...@bauerservices.co.uk wrote: colocation ftpsite-with-webip inf: ActiveFTPSite WebIP colocation website-with-ip inf: ActiveFTPSite WebIP order apache-after-ip inf: WebIP ActiveFTPSite order propftpd-after-webip inf: WebIP

Re: [Pacemaker] CMAN - Pacemaker - Porftpd setup

2011-12-06 Thread Florian Haas
On Tue, Dec 6, 2011 at 3:16 PM, Bensch, Kobus kobus.ben...@bauerservices.co.uk wrote: Hi Florian Thanks for the reply. 1.) No reason. I can get rid of one of each Did you, and if so has it changed the situation? 2.) The result of crm_mon -rf OK, no failcounts. Can you create a CIB dump

Re: [Pacemaker] Where to install applications

2011-12-02 Thread Florian Haas
On Fri, Dec 2, 2011 at 5:35 PM, Charles DeVoe scarecrow...@yahoo.com wrote: We are building a 4 node active/active cluster, which I believe is the same as High Performance. Not quite. That's still an HA cluster with some scale-out capability. HPC is a slightly different ballgame.   The

Re: [Pacemaker] managing config files as resources

2011-12-01 Thread Florian Haas
Hi Larry, On Thu, Dec 1, 2011 at 6:59 PM, Larry Brigman larry.brig...@gmail.com wrote: Is there a method to manage individual files as resources? Which RA would be used and any pointer as to how to configure it would be great. Specifically we need to sync some files between nodes that have

Re: [Pacemaker] managing config files as resources

2011-12-01 Thread Florian Haas
On Thu, Dec 1, 2011 at 10:45 PM, Larry Brigman larry.brig...@gmail.com wrote: On Thu, Dec 1, 2011 at 1:42 PM, Florian Haas flor...@hastexo.com wrote: On Thu, Dec 1, 2011 at 10:35 PM, Larry Brigman larry.brig...@gmail.com wrote: Yes, the files can be changed dynamically - mostly by a user

  1   2   3   4   >