Re: [Pacemaker] 1) attrd, crmd, cib, stonithd going to 100% CPU after standby 2) monitoring bug 3) meta failure-timeout issue

2011-10-03 Thread Proskurin Kirill
On 10/03/2011 05:32 AM, Andrew Beekhof wrote: corosync-1.4.1 pacemaker-1.1.5 pacemaker runs with ver: 1 2) This one is scary. I twice run on situation then pacemaker thinks what resource is started but it is not. RA is misbehaving. Pacemaker will only consider a resource running if the RA

Re: [Pacemaker] pacemaker/dlm problems

2011-10-03 Thread Andrew Beekhof
On Mon, Oct 3, 2011 at 3:34 PM, Vladislav Bogdanov bub...@hoster-ok.com wrote: 03.10.2011 04:41, Andrew Beekhof wrote: [...] If pacemaker fully finish processing of one membership change - elect new DC on a quorate partition, and do not try to take over dc role (or release it) on a

Re: [Pacemaker] pacemaker/dlm problems

2011-10-03 Thread Vladislav Bogdanov
03.10.2011 10:56, Andrew Beekhof wrote: On Mon, Oct 3, 2011 at 3:34 PM, Vladislav Bogdanovbub...@hoster-ok.com wrote: 03.10.2011 04:41, Andrew Beekhof wrote: [...] If pacemaker fully finish processing of one membership change - elect new DC on a quorate partition, and do not try to take over

Re: [Pacemaker] Trouble with ordering

2011-10-03 Thread Gerald Vogt
On 03.10.11 03:47, Serge Dubrouski wrote: As I wrote before: you should be able to test this easily by sending a STOP signal to the named process. At least in this situation I see that the rndc stop doesn't return before those 60s. Indeed you are right. Thanks for catching.

[Pacemaker] unmanaged a resource

2011-10-03 Thread Hugo Deprez
Dear community, I have a cluster with DRBD and apache2 resources. I wanted to do some test with apache2, I knew that my test could break apache2. Hence I did a #crm resource unamange apache2 The resource was unmanage at the crm_mon. When I did my test apache2 didn't restart but corosync try

Re: [Pacemaker] Master won't get promoted

2011-10-03 Thread Charles Richard
Hi, Thanks for the answer below, that clears up why that wasn't working without a stonith device. I'm wondering if i would need a stonith device if we plan to have 2 redundant nic interfaces on each node (connected to a different switch) for the lan connection plus one nic for the drbd sync

Re: [Pacemaker] Stonith / Fencing

2011-10-03 Thread Fiorenza Meini
Il 29/09/2011 09:13, Andrew Beekhof ha scritto: On Wed, Sep 28, 2011 at 5:52 PM, Fiorenza Meinifme...@esseweb.eu wrote: Hi there, I'm working on stonith on my test cluster. It has, to me, a strange behaviour: when the condition to fence the other node happens, is it normal that both

Re: [Pacemaker] Fencing xen-server host

2011-10-03 Thread Fiorenza Meini
Il 22/09/2011 12:23, Dejan Muhamedagic ha scritto: Hi, On Thu, Sep 22, 2011 at 10:56:57AM +0200, Fiorenza Meini wrote: Hi there, I found this: http://code.google.com/p/fence-xenserver/wiki/Installation I installed on my test cluster system and from command line it works properly, but I have

Re: [Pacemaker] Master won't get promoted

2011-10-03 Thread Dejan Muhamedagic
Hi, On Mon, Oct 03, 2011 at 10:27:56AM -0300, Charles Richard wrote: Hi, Thanks for the answer below, that clears up why that wasn't working without a stonith device. I'm wondering if i would need a stonith device if we plan to have 2 redundant nic interfaces on each node (connected to a

Re: [Pacemaker] 4 servers; different resources on different servers?

2011-10-03 Thread Dejan Muhamedagic
Hi, On Sun, Oct 02, 2011 at 10:20:34AM +0200, Tomasz Chmielewski wrote: I want to build a cluster of 4 servers (servers A, B, C, D), with different resources on them (mysql, webserver): webservers: A B C D mysql: A B Is there a way to make Pacemaker/heartbeat to let assign MySQL IP

Re: [Pacemaker] Concurrent runs of 'crm configure primitive' interfering

2011-10-03 Thread Dejan Muhamedagic
On Wed, Sep 28, 2011 at 10:52:16AM -0400, Brian J. Murrell wrote: On 11-09-28 10:20 AM, Dejan Muhamedagic wrote: Hi, Hi, I'm really not sure. Need to investigate this area more. Well, I am experimenting with cibadmin. It's certainly not as nice and shiny as crm shell though. :-)

Re: [Pacemaker] Dual-Primary DRBD with OCFS2 on SLES 11 SP1

2011-10-03 Thread Dejan Muhamedagic
Hi, On Thu, Sep 29, 2011 at 04:06:10PM +0100, darren.mans...@opengi.co.uk wrote: Sorry for top-posting, I'm Outlook-afflicted. Poor you This is also my problem; In the full production environment there will be low-level hardware fencing by means of IBM RSA/ASM but this is a VMware test

Re: [Pacemaker] Dual-Primary DRBD with OCFS2 on SLES 11 SP1

2011-10-03 Thread Dejan Muhamedagic
Hi, On Thu, Sep 29, 2011 at 10:47:33AM -0400, Nick Khamis wrote: Hello Dejan, Sorry to hijack, I am also working on the same type of setup as a prototype. What is the best way to get stonith included for VM setups? Maybe an SSH stonith? external/libvirt, though somebody said that that

Re: [Pacemaker] Announce: LCMC (Linux Cluster Management Console)

2011-10-03 Thread Rasto Levrinc
On Mon, Oct 3, 2011 at 10:07 AM, Kulovits Christian - OS ITSC christian.kulov...@austrian.com wrote: Hi Rasto Both DRBD Console 9.9 and LCMC 1.0 have the same problem, there is only a tab for the last connected Cluster. The others connected Clusters can only be selected for disconnect from

Re: [Pacemaker] concurrent uses of cibadmin: Signon to CIB failed: connection failed

2011-10-03 Thread Lars Ellenberg
On Thu, Sep 29, 2011 at 03:45:32PM -0400, Brian J. Murrell wrote: So, in another thread there was a discussion of using cibadmin to mitigate possible concurrency issue of crm shell. I have written a test program to test that theory and unfortunately cibadmin falls down in the face of heavy

Re: [Pacemaker] Dual-Primary DRBD with OCFS2 on SLES 11 SP1

2011-10-03 Thread Vladislav Bogdanov
Hi, 29.09.2011 17:47, Nick Khamis wrote: Hello Dejan, Sorry to hijack, I am also working on the same type of setup as a prototype. What is the best way to get stonith included for VM setups? Maybe an SSH stonith? Again, this is just for the prototype. You may look at fence-virt. I use

Re: [Pacemaker] pacemaker/dlm problems

2011-10-03 Thread Andrew Beekhof
On Mon, Oct 3, 2011 at 7:29 PM, Vladislav Bogdanov bub...@hoster-ok.com wrote: 03.10.2011 10:56, Andrew Beekhof wrote: On Mon, Oct 3, 2011 at 3:34 PM, Vladislav Bogdanovbub...@hoster-ok.com  wrote: 03.10.2011 04:41, Andrew Beekhof wrote: [...] If pacemaker fully finish processing of one

Re: [Pacemaker] 4 servers; different resources on different servers?

2011-10-03 Thread Tim Serong
On 04/10/11 04:06, Nick Khamis wrote: I forgot to ask, for creating an asymmetric cluster, do the services (mysql, apache etc..) have to be installed on all the nodes. Probably. Pacemaker will still try to probe resources on all nodes, to ensure they're not running, then the RA will return

Re: [Pacemaker] Running two clusters on same node

2011-10-03 Thread Med Hmici
Hi Andrew, First thank you for getting back to me. The strategy is for a split controlled upgrade. Thus having to momentarily create an isolated cluster, with different software setup, before taking over the official port. This is at an early stage of thinking. So I might have missed some

Re: [Pacemaker] Trouble with ordering

2011-10-03 Thread Serge Dubrouski
On Mon, Oct 3, 2011 at 7:16 AM, Gerald Vogt v...@spamcop.net wrote: On 03.10.11 03:47, Serge Dubrouski wrote: As I wrote before: you should be able to test this easily by sending a STOP signal to the named process. At least in this situation I see that the rndc stop doesn't