Re: [Linux-HA] heartbeat2 groups

2007-05-24 Thread Andrew Beekhof
On 5/23/07, Jacob Leaver [EMAIL PROTECTED] wrote: Hello all, I am using heartbeat 2 with crm, since it seemed to be the thing to do. I'm only using a small fraction of the feature set, since this is my loadbalancer, heartbeat2 handles failover and ldirectord is managing the ipvs stuff. My

Re: [Linux-HA] Re: Writing thirdparty program to manage heartbeat

2007-05-24 Thread Andrew Beekhof
On 5/23/07, Tao Yu [EMAIL PROTECTED] wrote: Is it a practical way to call cibadmin/crm_resource/etc.. from the other program? sure. or you can connect directly to the CIB (from any host over a TLS connection) and send XML instructions On 5/23/07, Tao Yu [EMAIL PROTECTED] wrote: Hi, I

Re: [Linux-HA] Problems to add a monitor operation

2007-05-24 Thread Andrew Beekhof
On 5/23/07, Dejan Muhamedagic [EMAIL PROTECTED] wrote: On Tue, May 22, 2007 at 04:43:49PM -0300, [EMAIL PROTECTED] wrote: Hi Everyone, I'm trying to add monitor operations for some resources I've configured in the cib.xml, but when I try to do it using the GUI, I'm getting some errors:

Re: [Linux-HA] Problems to add a monitor operation

2007-05-24 Thread Andrew Beekhof
On 5/23/07, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi Dejan, Thanks for your help! I've tried to add the monitor operation as you explained, but got the following error: 1) cibadmin -Q -o resources rsc.xml s0580crmdb2pr1:~ # cat rsc.xml resources group ordered=true description=Grupo

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Kevin Jamieson
Brian Reichert wrote: What I tracked down was that if the box powered down too quickly for heartbeat to clean up, a PID file was left in place: ... But, there's no check to assure the recorded PID is not stale. Have others seen this? This code seems to be in 2.0.8 as well... FYI, I filed

Re: [Linux-HA] Re: Writing thirdparty program to manage heartbeat

2007-05-24 Thread Andrew Beekhof
On 5/24/07, Max Hofer [EMAIL PROTECTED] wrote: On Wednesday 23 May 2007, Tao Yu wrote: Is it a practical way to call cibadmin/crm_resource/etc.. from the other program? On 5/23/07, Tao Yu [EMAIL PROTECTED] wrote: Hi, I am trying to write some thirdparty programs to manage heartbeat.

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Andrew Beekhof
On 5/24/07, Kevin Jamieson [EMAIL PROTECTED] wrote: Brian Reichert wrote: What I tracked down was that if the box powered down too quickly for heartbeat to clean up, a PID file was left in place: ... But, there's no check to assure the recorded PID is not stale. Have others seen this?

Re: [Linux-HA] ld error while running ConfigureMe make

2007-05-24 Thread David Lee
On Tue, 22 May 2007, jake brown wrote: Is there a way to manually tell it not to install the quorumd? That ought not to be necessary. I have been trying to build linux-ha 2.0.8 under solaris10 x86 and everytime I build it I get this error any clue? gnutls_x509_crt_init

[Linux-HA] Bonding and simplify

2007-05-24 Thread oohurbert
Hi,everybody,pI've studied heartbeat for several weeks, and found it has perfect and overall functions. But it is a little big for /ppme./ppSystem discription: Two node, Redhat EL4, heartbeat-2.0.8,/ppsame hardware and software configure, same processes running, but different

Re: [Linux-HA] Bonding and simplify

2007-05-24 Thread Andrew Beekhof
my eyes! my eyes! please, no html emails :-) On 5/24/07, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi,everybody,pI've studied heartbeat for several weeks, and found it has perfect and overall functions. But it is a little big for /ppme./ppSystem discription: Two node, Redhat

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Brian Reichert
On Thu, May 24, 2007 at 12:21:13AM -0700, Kevin Jamieson wrote: Brian Reichert wrote: What I tracked down was that if the box powered down too quickly for heartbeat to clean up, a PID file was left in place: ... But, there's no check to assure the recorded PID is not stale. Have

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Andrew Beekhof
On 5/24/07, Brian Reichert [EMAIL PROTECTED] wrote: On Thu, May 24, 2007 at 12:21:13AM -0700, Kevin Jamieson wrote: Brian Reichert wrote: What I tracked down was that if the box powered down too quickly for heartbeat to clean up, a PID file was left in place: ... But, there's no check

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Carson Gaspar
David Lee wrote: Andrew: Thanks for doing that, especially the concern for the non-Linux systems. That concern is much appreciated. Alas: The test -e that attempts to do this is itself non-portable (so bad shell syntax etc.). I think the relevant (for this context) flags that seem to be

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Andrew Beekhof
On 5/24/07, Carson Gaspar [EMAIL PROTECTED] wrote: David Lee wrote: Andrew: Thanks for doing that, especially the concern for the non-Linux systems. That concern is much appreciated. Alas: The test -e that attempts to do this is itself non-portable (so bad shell syntax etc.). I think

Re: [Linux-HA] pingd problems

2007-05-24 Thread fabiomm
Hi Andrew!! Thanks a lot for the tip. I thought we did not have any update to heartbeat from Novell, but looking at Yast online update ( https://you.novell.com/update/x86_64/update/SUSE-SLES/10/rpm/src/) I could find the heartbeat 2.0.7 available for download. After updating it and setting the

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Brian Reichert
On Thu, May 24, 2007 at 06:05:27PM +0200, Andrew Beekhof wrote: I don't see what the checks in /proc gets you at this point; how is that different than ( kill(pid,0) != -1 ) pid !=getpid() ) I'm not used to using /proc for this stuff it tells you if the pid has been reused by

Re: [Linux-HA] issue with management of heartbeat.pid file

2007-05-24 Thread Brian Reichert
On Tue, May 22, 2007 at 05:26:04PM -0400, Brian Reichert wrote: I've been testing auto_failback in our 2.0.7-based lcuster, and have found sometimes failback doesn't occur. We're managing a virtual IP via a haresources file on a Red Hat 4 box. What I tracked down was that if the box

[Linux-HA] resend: issue with management of heartbeat.pid file

2007-05-24 Thread Brian Reichert
Re-sending, this time without the large attachements that blocked it's posting. I can provide those straces, if folks want. - Forwarded message from Brian Reichert [EMAIL PROTECTED] - Date: Wed, 23 May 2007 15:24:45 -0400 From: Brian Reichert [EMAIL PROTECTED] To: General Linux-HA

Re: [Linux-HA] Problems to add a monitor operation

2007-05-24 Thread fabiomm
Hi Andrew!! Hi Dejan!! Thanks a lot for the help! As I said in another topic I've posted, Novell has made available an update for heartbeat from 2.0.5 to 2.0.7. After updating the package this problem was solved too, now I can add it usign the GUI! :) Thanks a lot for the help! Best Regards,

[Linux-HA] More simple solution

2007-05-24 Thread huijun lu
Hi,everybody, I've studied heartbeat for several weeks, and found it has perfect and overall functions. But it is a little big for me. System description: Two node, Redhat EL4, heartbeat-2.0.8, same hardware and software configure, same processes running, but different behavior(active node

Re: [Linux-HA] resend: issue with management of heartbeat.pid file

2007-05-24 Thread Brian Reichert
On Thu, May 24, 2007 at 03:29:45PM -0400, Brian Reichert wrote: This is the patch to 2.0.8 that I'm testing: # diff -U3 heartbeat/heartbeat.c.orig heartbeat/heartbeat.c --- heartbeat/heartbeat.c.orig 2007-01-11 21:57:05.0 -0500 +++ heartbeat/heartbeat.c 2007-05-23

RE: [Linux-HA] Problem with CRM multiple failovers in testing

2007-05-24 Thread Peter Mueller
a v2-style cluster with a 19200bps serial link? I think that in itself is an interesting attempt ;-) I left it at default (from RPM) figuring that was safest. I understand now this is recommended to be maximum speed. However, it is not obvious. I am using supermicro 6015p-8TR servers. No

[Solved] RE: [Linux-HA] Problem with CRM multiple failovers in testing

2007-05-24 Thread Peter Mueller
a v2-style cluster with a 19200bps serial link? I think that in itself is an interesting attempt ;-) baud 230400 now. Thnx for documentation http://www.linux-ha.org/ha.cf#baud. I don't really see a recommendation for higher baud in documentation for crm=yes clusters. Is this a good place to