Re: [Linux-HA] Question about IPaddr

2011-02-04 Thread Max
Robert, > ... > You should find it on > http://lists.linux-ha.org/pipermail/linux-ha/2011-February/thread.html > by tomorrow or so. (title: 'Add real monitoring capabilities to IPaddr2 > resource agent') [Cross post this as you wish] > ... >> >> ip addr show [interface] >> >> and look for the e

[Linux-HA] NFSv4 with Heartbeat and DRBD

2011-02-04 Thread Ricardo Botelho de Sousa
Hello All! We have implemented a solution based around heartbeat v3 and DRBD. While everything seems to work very well we have some difficulty with regard to the time it takes for the NFS service to become fully available. How long it is expected for a graceful fail-over with NFSv4 to take?

Re: [Linux-HA] Question about IPaddr

2011-02-04 Thread Robert Euhus
Hi, I have just sent an initial draft for real 'monitoring' in IPaddr2 to the linux-ha-dev mailing list. Sorry for cross posting, but I thought you might be interested to test and/or give some feedback there :) You should find it on http://lists.linux-ha.org/pipermail/linux-ha/2011-February/thr

Re: [Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Dimitri Maziuk
Lars Ellenberg wrote: > On Fri, Feb 04, 2011 at 02:35:35PM +0100, Ulrich Windl wrote: >> Lars, >> >> you are right, and I saw that my guess to use /proc/stat was wrong. top is >> slow in getting the current CPU usage. So basically I wondered if you need >> the CPU usage at all. If you'd switch to

[Linux-HA] On RHEL6 / Use of agent fence_ipmilan in Pacemaker configuration but with new action "diag"

2011-02-04 Thread Alain.Moulle
Hi, I've to test a new option of fence_ipmilan added by RH , which is "diag" , meaning that the fence will send via the IPMI a DIAG signal to the node to be fenced so that it enters a dump phase. With fence_ipmilan executed manually with -o diag , it works fine. In pacemaker, I've set action=

Re: [Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Lars Ellenberg
On Fri, Feb 04, 2011 at 02:35:35PM +0100, Ulrich Windl wrote: > Lars, > > you are right, and I saw that my guess to use /proc/stat was wrong. top is > slow in getting the current CPU usage. So basically I wondered if you need > the CPU usage at all. If you'd switch to "load", you could get it a

Re: [Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Michael Schwartzkopff
On Friday 04 February 2011 14:35:35 Ulrich Windl wrote: > Lars, > > you are right, and I saw that my guess to use /proc/stat was wrong. top is > slow in getting the current CPU usage. So basically I wondered if you need > the CPU usage at all. If you'd switch to "load", you could get it a lot > fa

[Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Ulrich Windl
Lars, you are right, and I saw that my guess to use /proc/stat was wrong. top is slow in getting the current CPU usage. So basically I wondered if you need the CPU usage at all. If you'd switch to "load", you could get it a lot faster. To be honest: I wondered what "HealthCPU" would monitor abo

Re: [Linux-HA] pacemaker/HealthCPU

2011-02-04 Thread Lars Ellenberg
On Thu, Feb 03, 2011 at 01:09:04PM +0100, Michael Schwartzkopff wrote: > On Thursday 03 February 2011 12:35:34 Ulrich Windl wrote: > > Hi! > > > > I'm starting to explore Linux-HA. Examining one of the monitors, I think > > things could be made much more efficient. For example: To get the percent

[Linux-HA] resource not restarted due to score value

2011-02-04 Thread Haussecker, Armin
Hi, in our 2-node-cluster we have a clone resource StorGr1 and two primitive resources DummyVM1 and DummyVM2. StorGr1 should be started before DummyVM1 and DummyVM2 due to order constraints. StorGr1 clone was started on both cluster nodes goat1 and sheep1. DummyVM1 and DummyVM2 were both started

[Linux-HA] Cannot add more than 4 host to ping test list

2011-02-04 Thread Stratos Zolotas
Hello to all. I have a very simple setup (trying to learn) with two nodes (Active/Passive) and a simple resource (failover ip). I'm trying to setup the cluster to change the running node for the resource when ping fails. I have four hosts in my host list and everything is working nice. The resour

Re: [Linux-HA] failed actions of heartbeat..

2011-02-04 Thread Erik Dobák
i think i figured it out now, and it works stable now for a week: the jboss resource was missing this option: *pstring* ="^/usr/jdk/latest/bin/java.-Dprogram.name=run.sh.*10.100.102.105" i did not define it at first because it was stated its optional but when i did read the jboss run script in re

[Linux-HA] OCF_RESKEY_CRM_meta_timeout not matching monitor timeout meta-data

2011-02-04 Thread Brett Delle Grazie
Hi, Apologies for cross-posting but I'm not sure where this problem resides. I'm running: corosync-1.2.7-1.1.el5.x86_64 corosynclib-1.2.7-1.1.el5.x86_64 cluster-glue-1.0.6-1.6.el5.x86_64 cluster-glue-libs-1.0.6-1.6.el5.x86_64 pacemaker-1.0.10-1.4.el5.x86_64 pacemaker-libs-1.0.10-1.4.el5.x86_64 re

Re: [Linux-HA] corosync + pacemaker on FC13 problem

2011-02-04 Thread Andrew Beekhof
On Fri, Feb 4, 2011 at 8:25 AM, Linux Cook wrote: > hi! > > I need help into my corosync and pacemaker configuration. Corosync is > supposed to be starting pacemaker by itself but its not what's happening > whenever I start Corosync. Process shows: > > ps -fax| grep corosync > >  9032 ?        Ssl