[Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-20 Thread Lars Ellenberg
Recent discussions with Dejan made me again more prominently aware of a few issues we probably all know about, but usually dismis as having not much relevance in the real-world. The facts: * a pidfile typically only stores a pid * a pidfile may stale, not properly cleaned up when the pid

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-20 Thread Alan Robertson
For the Assimilation code I use the full pathname of the binary from /proc to tell if it's one of mine. That's not perfect if you're using an interpreted language. It works quite well for compiled languages. On 10/20/2014 01:17 PM, Lars Ellenberg wrote: Recent discussions with Dejan made me

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-20 Thread Lars Ellenberg
On Mon, Oct 20, 2014 at 03:04:31PM -0600, Alan Robertson wrote: On 10/20/2014 02:52 PM, Alan Robertson wrote: For the Assimilation code I use the full pathname of the binary from /proc to tell if it's one of mine. That's not perfect if you're using an interpreted language. It works quite

Re: [Linux-ha-dev] RFC: pidfile handling; current worst case: stop failure and node level fencing

2014-10-20 Thread Alan Robertson
On 10/20/2014 03:21 PM, Lars Ellenberg wrote: On Mon, Oct 20, 2014 at 03:04:31PM -0600, Alan Robertson wrote: On 10/20/2014 02:52 PM, Alan Robertson wrote: For the Assimilation code I use the full pathname of the binary from /proc to tell if it's one of mine. That's not perfect if you're

[Linux-HA] Remote node attributes support in crmsh

2014-10-20 Thread Vladislav Bogdanov
Hi Kristoffer, do you plan to add support for recently added remote node attributes feature to chmsh? Currently (at least as of 2.1, and I do not see anything relevant in the git log) crmsh fails to update CIB if it contains node attributes for remote (bare-metal) node, complaining that

Re: [Linux-HA] Remote node attributes support in crmsh

2014-10-20 Thread Dejan Muhamedagic
Hi Vladislav, On Mon, Oct 20, 2014 at 09:03:40AM +0300, Vladislav Bogdanov wrote: Hi Kristoffer, do you plan to add support for recently added remote node attributes feature to chmsh? Currently (at least as of 2.1, and I do not see anything relevant in the git log) crmsh fails to update

Re: [Linux-HA] Remote node attributes support in crmsh

2014-10-20 Thread Vladislav Bogdanov
20.10.2014 18:23, Dejan Muhamedagic wrote: Hi Vladislav, Hi Dejan! On Mon, Oct 20, 2014 at 09:03:40AM +0300, Vladislav Bogdanov wrote: Hi Kristoffer, do you plan to add support for recently added remote node attributes feature to chmsh? Currently (at least as of 2.1, and I do not see

[Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread John Scalia
Hi all, I'm trying to build my first ever HA cluster and I'm using 3 VMs running CentOS 6.5. I followed the instructions to the letter at: http://clusterlabs.org/quickstart-redhat.html and everything appears to start normally, but if I run cman_tool nodes -a, I only see: Node StsInc

[Linux-HA] AUTO: Robert Koeppl ist außer Haus. Robert Koeppl is out of office (Rückkehr am 23.10.2014)

2014-10-20 Thread Robert . Koeppl
Ich kehre zurück am 23.10.2014. Hinweis: Dies ist eine automatische Antwort auf Ihre Nachricht Re: [Linux-HA] Remote node attributes support in crmsh gesendet am 20.10.2014 17:23:46. Diese ist die einzige Benachrichtigung, die Sie empfangen werden, während diese Person abwesend ist.

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread Digimer
On 20/10/14 02:50 PM, John Scalia wrote: Hi all, I'm trying to build my first ever HA cluster and I'm using 3 VMs running CentOS 6.5. I followed the instructions to the letter at: http://clusterlabs.org/quickstart-redhat.html and everything appears to start normally, but if I run cman_tool

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread John Scalia
Sure, and thanks for helping. Here's the /etc/cluster/cluster.conf file and it is identical on all three systems: cluster config_version=11 name=pgdb_cluster fence_daemon/ clusternodes clusternode name=csgha1 nodeid=1 fence method name=pcmk-redirect device

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread Maciej Rostański
Hello, In my experience such problems were the effect of my mistakes, such as not having all hosts in /etc/hosts file. Check this, please, I know it sounds simple. Also, commands: pcs cluster setup --name clustername node1 node2 node3 pcs cluster enable pcs cluster start are much more pleasant

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread John Scalia
Thanks, but on centOS are you saying to use pcs cluster start rather than using service cman start and service pacemaker start? I was just going by the tutorial, which doesn't mention this. On 10/20/2014 3:44 PM, Maciej Rostański wrote: Hello, In my experience such problems were the effect of

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread Maciej Rostański
Well, with 6.4 and 6.5 (which I like a lot) there is this specific situation - no more crm, only pcs and ccs, but on the other hand, stack with cman (which is being replaced by corosync 2.0 now). So the documentation found on various sites is rarely handy... 2014-10-20 22:17 GMT+02:00 John Scalia

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread Andrew Beekhof
On 21 Oct 2014, at 7:17 am, John Scalia jayknowsu...@gmail.com wrote: Thanks, but on centOS are you saying to use pcs cluster start rather than using service cman start and service pacemaker start? I was just going by the tutorial, which doesn't mention this. 'service pacemaker start'

Re: [Linux-HA] New user can't get cman to recognize other systems

2014-10-20 Thread jayknowsunix
OK, got it. Sent from my iPad On Oct 20, 2014, at 10:10 PM, Andrew Beekhof and...@beekhof.net wrote: On 21 Oct 2014, at 7:17 am, John Scalia jayknowsu...@gmail.com wrote: Thanks, but on centOS are you saying to use pcs cluster start rather than using service cman start and service