[Linux-ha-dev] New master/slave resource agent for DB2 databases in HADR (High Availability Disaster Recovery) mode

2011-02-04 Thread Holger Teutsch
Hi, I submit a resource agent (db2hadr) managing a DB2 HADR (High Availability Disaster Recovery) pair as master/slave resource for review and possible inclusion into the project. As a second step I volunteer to add some collateral documentation to the ha-linux Wiki for this and the db2 agent.

Re: [Linux-ha-dev] slapd ocf resource agent

2011-02-04 Thread jeroen
Hi, I've updated the resource agent so that it tries to establish an LDAP connection. Most of the issues where resolved, but I haven't had the time yet to test with other OSes so I changed /bin/sh to /bin/bash. I'll try to remove bashisms as soon as possible. Jeroen slapd.sh Description:

[Linux-ha-dev] Add real monitoring capabilities to IPaddr2 resource agent

2011-02-04 Thread Robert Euhus
Hi there, I have extended the monitor operation of the IPaddr2 resource agent to have some real monitoring capabilities. Depending on the OCF_CHECK_LEVEL it will do some of the following test: check wether: - the ARP cache for the interface is not empty. - packets are received on the

Re: [Linux-ha-dev] slapd ocf resource agent

2011-02-04 Thread Lars Ellenberg
On Fri, Feb 04, 2011 at 04:16:23PM +0100, jer...@intuxicated.org wrote: Hi, I've updated the resource agent so that it tries to establish an LDAP connection. Most of the issues where resolved, but I haven't had the time yet to test with other OSes so I changed /bin/sh to /bin/bash. I'll try

Re: [Linux-ha-dev] Add real monitoring capabilities to IPaddr2 resource agent

2011-02-04 Thread Lars Ellenberg
+# get the link status on $NIC +# returns UP or DOWN or whatever ip reports (UNKNOWN?) +get_link_status () { + $IP2UTIL -o link show dev $NIC \ + | sed 's/.* state \([^ ]*\) .*/\1/' +} + +# returns the number of received rx packets on $NIC +get_rx_packets () { +

Re: [Linux-HA] corosync + pacemaker on FC13 problem

2011-02-04 Thread Andrew Beekhof
On Fri, Feb 4, 2011 at 8:25 AM, Linux Cook linuxc...@gmail.com wrote: hi! I need help into my corosync and pacemaker configuration. Corosync is supposed to be starting pacemaker by itself but its not what's happening whenever I start Corosync. Process shows: ps -fax| grep corosync  9032 ?

[Linux-HA] OCF_RESKEY_CRM_meta_timeout not matching monitor timeout meta-data

2011-02-04 Thread Brett Delle Grazie
Hi, Apologies for cross-posting but I'm not sure where this problem resides. I'm running: corosync-1.2.7-1.1.el5.x86_64 corosynclib-1.2.7-1.1.el5.x86_64 cluster-glue-1.0.6-1.6.el5.x86_64 cluster-glue-libs-1.0.6-1.6.el5.x86_64 pacemaker-1.0.10-1.4.el5.x86_64 pacemaker-libs-1.0.10-1.4.el5.x86_64

Re: [Linux-HA] failed actions of heartbeat..

2011-02-04 Thread Erik Dobák
i think i figured it out now, and it works stable now for a week: the jboss resource was missing this option: *pstring* =^/usr/jdk/latest/bin/java.-Dprogram.name=run.sh.*10.100.102.105 i did not define it at first because it was stated its optional but when i did read the jboss run script in

[Linux-HA] Cannot add more than 4 host to ping test list

2011-02-04 Thread Stratos Zolotas
Hello to all. I have a very simple setup (trying to learn) with two nodes (Active/Passive) and a simple resource (failover ip). I'm trying to setup the cluster to change the running node for the resource when ping fails. I have four hosts in my host list and everything is working nice. The

[Linux-HA] resource not restarted due to score value

2011-02-04 Thread Haussecker, Armin
Hi, in our 2-node-cluster we have a clone resource StorGr1 and two primitive resources DummyVM1 and DummyVM2. StorGr1 should be started before DummyVM1 and DummyVM2 due to order constraints. StorGr1 clone was started on both cluster nodes goat1 and sheep1. DummyVM1 and DummyVM2 were both started

Re: [Linux-HA] pacemaker/HealthCPU

2011-02-04 Thread Lars Ellenberg
On Thu, Feb 03, 2011 at 01:09:04PM +0100, Michael Schwartzkopff wrote: On Thursday 03 February 2011 12:35:34 Ulrich Windl wrote: Hi! I'm starting to explore Linux-HA. Examining one of the monitors, I think things could be made much more efficient. For example: To get the percent of

[Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Ulrich Windl
Lars, you are right, and I saw that my guess to use /proc/stat was wrong. top is slow in getting the current CPU usage. So basically I wondered if you need the CPU usage at all. If you'd switch to load, you could get it a lot faster. To be honest: I wondered what HealthCPU would monitor about

Re: [Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Michael Schwartzkopff
On Friday 04 February 2011 14:35:35 Ulrich Windl wrote: Lars, you are right, and I saw that my guess to use /proc/stat was wrong. top is slow in getting the current CPU usage. So basically I wondered if you need the CPU usage at all. If you'd switch to load, you could get it a lot faster.

[Linux-HA] On RHEL6 / Use of agent fence_ipmilan in Pacemaker configuration but with new action diag

2011-02-04 Thread Alain.Moulle
Hi, I've to test a new option of fence_ipmilan added by RH , which is diag , meaning that the fence will send via the IPMI a DIAG signal to the node to be fenced so that it enters a dump phase. With fence_ipmilan executed manually with -o diag , it works fine. In pacemaker, I've set

Re: [Linux-HA] Antw: Re: pacemaker/HealthCPU

2011-02-04 Thread Dimitri Maziuk
Lars Ellenberg wrote: On Fri, Feb 04, 2011 at 02:35:35PM +0100, Ulrich Windl wrote: Lars, you are right, and I saw that my guess to use /proc/stat was wrong. top is slow in getting the current CPU usage. So basically I wondered if you need the CPU usage at all. If you'd switch to load, you

Re: [Linux-HA] Question about IPaddr

2011-02-04 Thread Robert Euhus
Hi, I have just sent an initial draft for real 'monitoring' in IPaddr2 to the linux-ha-dev mailing list. Sorry for cross posting, but I thought you might be interested to test and/or give some feedback there :) You should find it on

[Linux-HA] NFSv4 with Heartbeat and DRBD

2011-02-04 Thread Ricardo Botelho de Sousa
Hello All! We have implemented a solution based around heartbeat v3 and DRBD. While everything seems to work very well we have some difficulty with regard to the time it takes for the NFS service to become fully available. How long it is expected for a graceful fail-over with NFSv4 to

Re: [Linux-HA] Question about IPaddr

2011-02-04 Thread Max
Robert, ... You should find it on http://lists.linux-ha.org/pipermail/linux-ha/2011-February/thread.html by tomorrow or so. (title: 'Add real monitoring capabilities to IPaddr2 resource agent') [Cross post this as you wish] ... ip addr show [interface] and look for the existence of