Re: [Pacemaker] cibadmin -Q: Call cib_query failed (-62): Timer expired

2013-10-01 Thread Andrew Beekhof
On 28/09/2013, at 5:37 AM, Radoslaw Garbacz wrote: > The problem was actually of a different nature - nothing to do with > cib_shm. The logs showed later on that the connection to cib was > established, just the corosync configuration file didn't hava a proper > quorum section, which caused the

Re: [Pacemaker] Corosync won't recover when a node fails

2013-10-01 Thread Andrew Beekhof
On 02/10/2013, at 5:24 AM, David Parker wrote: > Thanks, I did a little Googling and found the git repository for pcs. pcs won't help you rebuild pacemaker with cman support (or corosync 2.x support) turned on though. > Is there any way to make a two-node cluster work with the stock Debian

Re: [Pacemaker] Problems when quorum lost for a short period of time

2013-10-01 Thread Andrew Beekhof
On 02/10/2013, at 6:26 AM, Lev Sidorenko wrote: > Hello All! > > I have a 4-nodes cluster setup. > > It is actually 2 nodes for main+stanby and another two nodes just for > provide quorum. 1 extra would have been enough > > So, all resources run on the main node but only DRBD-slave runs on

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread Peter Romfeld
yes i need to use the aws command, i am using a VPC, after issueing the command i get a "true" statement and its done so i only want pacemaker to issue the one-shot command at failover. Here is what i have atm: (i know its still dirty, just lerning pacemaker) primitive drbd_mysql ocf:linbit:drbd

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread David Lang
the aws command is making the call to inform aws, if you were to bring up the address without making the aws command, would it work? If you are on a Virtual Private Cloud (VPC), it may, but I didn't think it would. If you can make it work without the aws command, then you can just use the stan

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread Peter Romfeld
Hey, when i change the secondary IP per hand or with external script on a Ubuntu Instance I just need: /etc/network/interfaces auto eth0 iface eth0 inet dhcp address 192.168.32.12 netmask 255.255.240.0 gateway 192.168.32.1 up ip addr add 192.168.32.11/20 dev eth0 and then run the s

[Pacemaker] Problems when quorum lost for a short period of time

2013-10-01 Thread Lev Sidorenko
Hello All! I have a 4-nodes cluster setup. It is actually 2 nodes for main+stanby and another two nodes just for provide quorum. So, all resources run on the main node but only DRBD-slave runs on the standby node. I have no-quorum-policy="stop" So, sometimes main node looses connection to the

Re: [Pacemaker] Corosync won't recover when a node fails

2013-10-01 Thread David Parker
Thanks, I did a little Googling and found the git repository for pcs. Is there any way to make a two-node cluster work with the stock Debian packages, though? It seems odd that this would be impossible. On Tue, Oct 1, 2013 at 3:16 PM, Larry Brigman wrote: > pcs is another package you will need

Re: [Pacemaker] Corosync won't recover when a node fails

2013-10-01 Thread Larry Brigman
pcs is another package you will need to install. On Oct 1, 2013 9:04 AM, "David Parker" wrote: > Hello, > > Sorry for the delay in my reply. I've been doing a lot of > experimentation, but so far I've had no luck. > > Thanks for the suggestion, but it seems I'm not able to use CMAN. I'm > runni

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread David Lang
On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: On Tue, Oct 01, 2013 at 10:07:12AM -0700, David Lang wrote: On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: On Tue, Oct 01, 2013 at 07:22:20AM -0700, David Lang wrote: On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: Hi David, On Mon, Sep 30, 2013 at

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread Dejan Muhamedagic
On Tue, Oct 01, 2013 at 10:07:12AM -0700, David Lang wrote: > On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: > > >On Tue, Oct 01, 2013 at 07:22:20AM -0700, David Lang wrote: > >>On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: > >> > >>>Hi David, > >>> > >>>On Mon, Sep 30, 2013 at 12:41:23PM -0700, Davi

Re: [Pacemaker] Bug? Resources running with realtime priority - possibly causing monitor timeouts

2013-10-01 Thread Dejan Muhamedagic
Hi, On Tue, Oct 01, 2013 at 11:07:35AM +0200, Joschi Brauchle wrote: > Hello everyone, > > on two (recently upgraded) SLES11SP3 machines, we are running an > active/passive NFS fileserver and several other high availability > services using corosync + pacemaker (see version numbers below). > > W

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread David Lang
On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: On Tue, Oct 01, 2013 at 07:22:20AM -0700, David Lang wrote: On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: Hi David, On Mon, Sep 30, 2013 at 12:41:23PM -0700, David Lang wrote: On Mon, 30 Sep 2013, David Lang wrote: On Mon, 30 Sep 2013, Michael S

Re: [Pacemaker] DRBD Master/Slave in a 3 node cluster

2013-10-01 Thread James Oakley
On Tuesday, October 1, 2013, Stefan Botter wrote: > I have a quite similar setup, currently running on stock 12.2. I have > a test > system just updated to 12.3, with the ha-clustering:Stable, and it > fails > with STONITH enabled almost instantly, due to certain segfaults in > the > stonith reso

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread Dejan Muhamedagic
On Tue, Oct 01, 2013 at 07:22:20AM -0700, David Lang wrote: > On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: > > >Hi David, > > > >On Mon, Sep 30, 2013 at 12:41:23PM -0700, David Lang wrote: > >>On Mon, 30 Sep 2013, David Lang wrote: > >> > >>>On Mon, 30 Sep 2013, Michael Schwartzkopff wrote: > >>>

Re: [Pacemaker] Corosync won't recover when a node fails

2013-10-01 Thread David Parker
Hello, Sorry for the delay in my reply. I've been doing a lot of experimentation, but so far I've had no luck. Thanks for the suggestion, but it seems I'm not able to use CMAN. I'm running Debian Wheezy with Corosync and Pacemaker installed via apt-get. When I installed CMAN and set up a clust

Re: [Pacemaker] Private node IPs, public VIP and routing

2013-10-01 Thread David Lang
On Tue, 1 Oct 2013, Dennis Jacobfeuerborn wrote: Hi, create a routing resource and group it together with the public VIP. Hm, thanks for the pointer. While this would work with a regular static route the problem here is that the default route has to be replace i.e. the old one has to be rem

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread David Lang
On Tue, 1 Oct 2013, Dejan Muhamedagic wrote: Hi David, On Mon, Sep 30, 2013 at 12:41:23PM -0700, David Lang wrote: On Mon, 30 Sep 2013, David Lang wrote: On Mon, 30 Sep 2013, Michael Schwartzkopff wrote: Am Montag, 30. September 2013, 21:12:56 schrieb Peter Romfeld: I am working in AWS i

Re: [Pacemaker] Private node IPs, public VIP and routing

2013-10-01 Thread Dennis Jacobfeuerborn
On 30.09.2013 13:24, Michael Schwartzkopff wrote: Am Montag, 30. September 2013, 13:11:28 schrieb Dennis Jacobfeuerborn: > Hi, > I am in the process of setting up a redundant NFS storage system and I'm > wondering how to best manage the systems/IPs. > The goal is that the nodes each get a

Re: [Pacemaker] DRBD Master/Slave in a 3 node cluster

2013-10-01 Thread Stefan Botter
Hi Dejan, On Tue, 1 Oct 2013 13:57:12 +0200 Dejan Muhamedagic wrote: > On Tue, Oct 01, 2013 at 09:26:14AM +0200, Stefan Botter wrote: > > I have a quite similar setup, currently running on stock 12.2. I > > have a test system just updated to 12.3, with the > > ha-clustering:Stable, and it fails

Re: [Pacemaker] DRBD Master/Slave in a 3 node cluster

2013-10-01 Thread Dejan Muhamedagic
Hi Stefan, On Tue, Oct 01, 2013 at 09:26:14AM +0200, Stefan Botter wrote: > Hi James, > > On Mon, 30 Sep 2013 12:31:52 -0700 > James Oakley wrote: > > > I am having some trouble with DRBD Master/Slave resources in a 3-node > > cluster. > > > > I am using the Pacemaker packages from ha-clusteri

Re: [Pacemaker] custom resource-agent

2013-10-01 Thread Dejan Muhamedagic
Hi David, On Mon, Sep 30, 2013 at 12:41:23PM -0700, David Lang wrote: > On Mon, 30 Sep 2013, David Lang wrote: > > >On Mon, 30 Sep 2013, Michael Schwartzkopff wrote: > > > >>Am Montag, 30. September 2013, 21:12:56 schrieb Peter Romfeld: > >>>I am working in AWS i cant just use a VIP i need to use

[Pacemaker] How to disallow resources to re-run after they were stopped

2013-10-01 Thread Mistina Michal
Dear all. I am using ping resource agent to check if the resources can run on the node. I have 2 node cluster. If the destination is down resources are also down on that particular node. If the destination can be pinged again the resources boot again on the node. My question is, how can I achiev

[Pacemaker] Bug? Resources running with realtime priority - possibly causing monitor timeouts

2013-10-01 Thread Joschi Brauchle
Hello everyone, on two (recently upgraded) SLES11SP3 machines, we are running an active/passive NFS fileserver and several other high availability services using corosync + pacemaker (see version numbers below). We are having severe problems with resource monitors timing out during our syste

Re: [Pacemaker] DRBD Master/Slave in a 3 node cluster

2013-10-01 Thread Stefan Botter
Hi James, On Mon, 30 Sep 2013 12:31:52 -0700 James Oakley wrote: > I am having some trouble with DRBD Master/Slave resources in a 3-node > cluster. > > I am using the Pacemaker packages from ha-clustering:Stable on > openSUSE 12.3. I was going to try the packages from Unstable to see > if they