[Linux-HA] unsubscribe

Grabowski , Jürgen Tue, 15 May 2007 06:37:47 -0700

unsubscribe


Mit freundlichen Grüßen
Zweckverband Kommunale Datenverarbeitung Oldenburg (KDO)
Der Verbandsgeschäftsführer



i. A. Jürgen Grabowski
Systemtechniker UNIX
Elsässer Straße 66
26121 Oldenburg
Telefon  0441 9714-238
Telefax  0441 9714-17238
E-Mail   [EMAIL PROTECTED]
Internet http://www.kdo.de 

 



Der Inhalt dieser E-Mail ist vertraulich und ausschließlich für den im 
Adressfeld bezeichneten Adressaten bestimmt. Wenn Sie nicht der vorgesehene 
Adressat dieser E-Mail oder dessen Vertreter sein sollten, so beachten Sie 
bitte, dass jede Form der Kenntnisnahme, Veröffentlichung, Vervielfältigung 
oder Weitergabe des Inhalts dieser E-Mail unzulässig ist. Wir bitten Sie, sich 
in diesem Fall mit dem Absender der E-Mail in Verbindung zu setzen und diese 
Mail sowie alle Kopien davon zu vernichten.

 
 
 

-----Ursprüngliche Nachricht-----
Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Im Auftrag von [EMAIL 
PROTECTED]
Gesendet: Dienstag, 15. Mai 2007 15:07
An: [email protected]
Betreff: Linux-HA Digest, Vol 42, Issue 54

Send Linux-HA mailing list submissions to
        [email protected]

To subscribe or unsubscribe via the World Wide Web, visit
        http://lists.linux-ha.org/mailman/listinfo/linux-ha
or, via email, send a message with subject or body 'help' to
        [EMAIL PROTECTED]

You can reach the person managing the list at
        [EMAIL PROTECTED]

When replying, please edit your Subject line so it is more specific than "Re: 
Contents of Linux-HA digest..."


Today's Topics:

   1. Re: Remove node from cluster (Max Hofer)
   2. Bonding for High Availability (Jan Kalcic)
   3. Re: Bonding for High Availability (Diego Julian Remolina)
   4. Re: Re: Fwd: Linux-HA Bug report (Dan Gahlinger)
   5. Re: NewToHA2    (Eric Marcus)
   6. What heartbeat version to install now ? (Benjamin Watine)
   7. RE: MySQL Master Master (Benjamin Lawetz)


----------------------------------------------------------------------

Message: 1
Date: Tue, 15 May 2007 12:40:48 +0200
From: Max Hofer <[EMAIL PROTECTED]>
Subject: Re: [Linux-HA] Remove node from cluster
To: [email protected]
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain;  charset="ansi_x3.4-1968"

Mariusz, could you please integrate this info in the Wiki?

On Tuesday 15 May 2007, Mariusz Mart wrote:
> Work fine:)
> Thanks
> 
> >>> "Andrew Beekhof" <[EMAIL PROTECTED]> 07-05-15 10:05 >>>
> On 5/12/07, Mariusz Mart <[EMAIL PROTECTED]> wrote:
> > Hi!
> >
> > I have 3 node cluster and I would like to remove one node from CRM 
> > database. Is there a simple solution for that?
> > I can just stop it and not use, but it will be better if there is no 
> > extra node in hb_gui;)
> 
> you need to stop the node
> then you need to run hb_delnode (i'm not sure of the options) then you 
> need to delete the node from the CIB using
>    cibadmin -D -o nodes -X '<node uname="yournodename"/>'
> and
>    cibadmin -D -o status -X '<node_status uname="yournodename"/>'


------------------------------

Message: 2
Date: Tue, 15 May 2007 11:50:42 +0200
From: Jan Kalcic <[EMAIL PROTECTED]>
Subject: [Linux-HA] Bonding for High Availability
To: General Linux-HA mailing list <[email protected]>
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1

Hi People,

I got my cluster running with heartbeat2 but using haresources file. The 
resources I've created are IPAddr, drbd and apache but doesn't matter.
What I need is double the network interface eth0 on each machine, create a bond 
interface on this and then create virtual IPAddr on the the bond interface. Is 
this possibile?
What I'm looking for is create a High Availability environment in a multiple 
switch topology network.

Regards,
Jan


------------------------------

Message: 3
Date: Tue, 15 May 2007 07:13:50 -0400
From: Diego Julian Remolina <[EMAIL PROTECTED]>
Subject: Re: [Linux-HA] Bonding for High Availability
To: General Linux-HA mailing list <[email protected]>
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi Jan,

It is very possible. I do this on a machine running RHEL4. I actually go a step 
further and do tagged VLANs (802.1q) on top of the bonded interface, so my 
configuration is like this:

[EMAIL PROTECTED] network-scripts]# cat ifcfg-eth0 DEVICE=eth0 ONBOOT=yes 
BOOTPROTO=none MASTER=bond0 SLAVE=yes USERCTL=no

[EMAIL PROTECTED] network-scripts]# cat ifcfg-eth1
DEVICE=eth1
ONBOOT=yes
BOOTPROTO=none
MASTER=bond0
SLAVE=yes
USERCTL=no

[EMAIL PROTECTED] network-scripts]# cat ifcfg-eth2
DEVICE=eth2
ONBOOT=yes
BOOTPROTO=none
MASTER=bond0
SLAVE=yes
USERCTL=no

[EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0 DEVICE=bond0 ONBOOT=yes 
BOOTPROTO=none USERCTL=no

[EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0.431
DEVICE=bond0.431
BOOTPROTO=static
ONBOOT=yes
IPADDR=X.Y.131.21
NETMASK=255.255.255.0
TYPE=Ethernet
VLAN=yes

[EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0.432
DEVICE=bond0.432
BOOTPROTO=static
ONBOOT=yes
IPADDR=X.Y.132.21
NETMASK=255.255.255.0
TYPE=Ethernet
VLAN=yes

[EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0.433
DEVICE=bond0.433
BOOTPROTO=static
ONBOOT=no
IPADDR=X.Y.133.21
NETMASK=255.255.255.0
TYPE=Ethernet
VLAN=yes

For the haresources or cib.xml file you must use the IPaddr2 script for the 
virtual IPs as shown here (It is of importance to define the different VLANs on 
top of the bonded interface in my case):

[EMAIL PROTECTED] network-scripts]# cat /etc/ha.d/haresources node1_fqdn 
IPaddr2::X.Y.131.20/24/bond0.431:0  IPaddr2::X.Y.132.20/24/bond0.432:0
IPaddr2::X.Y.133.20/24/bond0.433:0 drbddisk::export 
Filesystem::/dev/drbd0::/export::ext3::rw,usrquota,grpquota,acl,user_xattr 
saslauthd cyrus-imapd clamd mimedefang sendmail node2_fqdn 
IPaddr2::X.Y.131.19/24/bond0.431:0 IPaddr2::X.Y.132.19/24/bond0.432:1
IPaddr2::X.Y.145.133/24/bond0.433:2 drbddisk::web 
Filesystem::/dev/drbd1::/web::ext3::rw,usrquota,grpquota,acl,user_xattr mysqld 
httpd

If you have only bonding and use one virtual IP, then the IPaddr script should 
be enough, since you will only end up having bond0 and bond0:0 as your virtual 
interface. The need of the IPaddr2 script arises when you need to determine the 
name of the virtual interfaces so that heartbeat stops and starts the correct 
virtual IPs correctly.

You must also check your kernel and the bonding modes available for your 
network cards. Some bonding modes require special switch configurations. Some 
modes give you better throughput and other modes are oriented towards 
availability. Check the readme files for the bonding module in your kernel 
documentation.

HTH,

Diego


Jan Kalcic wrote:
> Hi People,
> 
> I got my cluster running with heartbeat2 but using haresources file. 
> The resources I've created are IPAddr, drbd and apache but doesn't matter.
> What I need is double the network interface eth0 on each machine, 
> create a bond interface on this and then create virtual IPAddr on the 
> the bond interface. Is this possibile?
> What I'm looking for is create a High Availability environment in a 
> multiple switch topology network.
> 
> Regards,
> Jan
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems



------------------------------

Message: 4
Date: Tue, 15 May 2007 08:16:30 -0400
From: "Dan Gahlinger" <[EMAIL PROTECTED]>
Subject: Re: [Linux-HA] Re: Fwd: Linux-HA Bug report
To: "General Linux-HA mailing list" <[email protected]>
Message-ID:
        <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

how about just 2.0.8.5 ?

Although I suppose it's possible this 2.0.9 that you built will "become" the 
"real" 2.0.9 at some point?

Dan.

On 5/15/07, Lars Marowsky-Bree <[EMAIL PROTECTED]> wrote:
>
> On 2007-05-15T10:47:43, Andrew Beekhof <[EMAIL PROTECTED]> wrote:
>
> > In fairness to SUSE, it was solely my doing while I learnt how to 
> > use the openSUSE build service.  The version there isnt an official 
> > SUSE release either.
> >
> > To be honest I didnt know what to call it... 2.0.8 seemed wrong 
> > (since there was an extra 4-5 months development in there) but there 
> > is no real 2.0.9 either... oh well.
>
> Well, the version in mercurial does identify itself as 2.0.9 when build.
> Maybe, to reflect the intermediate state, 2.0.8.999 would have been 
> better.
>
>
> Sincerely,
>     Lars
>
> --
> Teamlead Kernel, SuSE Labs, Research and Development SUSE LINUX 
> Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg) "Experience is 
> the name everyone gives to their mistakes." -- Oscar Wilde
>
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>


------------------------------

Message: 5
Date: Tue, 15 May 2007 08:25:52 -0400
From: "Eric Marcus" <[EMAIL PROTECTED]>
Subject: [Linux-HA] Re: NewToHA2   
To: <[email protected]>
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=US-ASCII

Hello!   Yes, it was the firewall.  Apparently my IPChains config didn't stick 
and the FW is off now.    I thought I updated the list but I don't know if I am 
replying correctly.   Am I supposed to put the Subject in like I am doing on 
this one and clean up the body a bit like this?     I am plugging along slowly 
with HA 2.  Its fun working with new stuff and luckily I am being paid to do 
this!  Now I am having some EVMS issue that is posted in the list.  I also have 
a call with Novell too.  A two pronged attack to that issue on my part I 
suppose.  
Thanks for the reply.  

   3. Re: NewToHA2 (Andrew Beekhof)


Message: 3
Date: Tue, 15 May 2007 11:44:20 +0200
From: "Andrew Beekhof" <[EMAIL PROTECTED]>
Subject: Re: [Linux- HA] NewToHA2
To: "General Linux- HA mailing list" <linux- [EMAIL PROTECTED] ha.org>
Message- ID:
        <[EMAIL PROTECTED]>
Content- Type: text/plain; charset=ISO- 8859- 1; format=flowed

its almost always a firewall.

try stopping the firewall completely and see if the problem persists.

On 5/8/07, Eric Marcus <[EMAIL PROTECTED]> wrote:
>
> Hello, I am new to HA2 and am having some configuration issues.   I installed 
> HA2  (2.0.8- 1) on two Suse 10 (SLES10) machines using Alan's Education 
> Project Screencast (http://www.linux- 
> ha.org/Education/Newbie/InstallHeartbeatScreencast)
>
> I think I have a node configuration issue even though it is in ha.cf.   I am 
> very familiar with Novell Cluster Services.   The problem I outline below 
> makes me think that both of the nodes are trying to be the "Master" but I 
> don't how to fix this.  I've spent a week on this and am feeling very stupid! 
>   Here goes.....
>
> My ha.cf file for the 2 servers shows
>
> use_logd yes
> bcast eth1
> node it- mgatedom it- mgatedomc
> crm on
>
>
> The logd.cf shows
>
> logfacility     daemon
>
>
> The authkeys show
>
> auth 1
> 1 sha1 cluster1
>
>
> Now, when I start it up on IT- MGATEDOM,  it shows "done"
>
> crm_mon shows only 1 node configured and after a couple minutes the "Current 
> DC: NONE" becomes "Current DC: it- mgatedom" with 0 resources configured.  It 
> still shows 1 node, not 2.
>
> Then I go to IT- MGATEDOMC to start it up......   It says "done" and when I 
> do a tail /var/log/message I see this
>
>
>
> it- mgatedomc:~ # /etc/init.d/heartbeat start Starting High- 
> Availability services:
>                                                                      
> done
>
> it- mgatedomc:~ # tail /var/log/messages May  8 12:06:16 it- mgatedomc 
> heartbeat: [4514]: info: G_main_add_TriggerHandler:  Added signal 
> manual handler May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> G_main_add_TriggerHandler:  Added signal manual handler May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp 
> failed, recreating.
> May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP 
> Broadcast heartb eat started on port 694 (694) interface eth1 May  8 
> 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
> heartb eat closed on port 694 interface eth1 -  Status: 1 May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal 
> handler for signal 17 May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> Local status now set to: ' up'
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 
> up.
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update 
> for node it-  mgatedom: status active May  8 12:06:17 it- mgatedomc 
> heartbeat: [4514]: info: Link it- mgatedomc:eth1 up.
> it- mgatedomc:~ # tail /var/log/messages May  8 12:06:16 it- mgatedomc 
> heartbeat: [4514]: info: G_main_add_TriggerHandler:  Added signal 
> manual handler May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> G_main_add_TriggerHandler:  Added signal manual handler May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp 
> failed, recreating.
> May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP 
> Broadcast heartb eat started on port 694 (694) interface eth1 May  8 
> 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
> heartb eat closed on port 694 interface eth1 -  Status: 1 May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal 
> handler for signal 17 May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> Local status now set to: ' up'
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 
> up.
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update 
> for node it-  mgatedom: status active May  8 12:06:17 it- mgatedomc 
> heartbeat: [4514]: info: Link it- mgatedomc:eth1 up.
> it- mgatedomc:~ # tail /var/log/messages May  8 12:06:16 it- mgatedomc 
> heartbeat: [4514]: info: G_main_add_TriggerHandler:  Added signal 
> manual handler May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> G_main_add_TriggerHandler:  Added signal manual handler May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp 
> failed, recreating.
> May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP 
> Broadcast heartb eat started on port 694 (694) interface eth1 May  8 
> 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
> heartb eat closed on port 694 interface eth1 -  Status: 1 May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal 
> handler for signal 17 May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> Local status now set to: ' up'
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 
> up.
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update 
> for node it-  mgatedom: status active May  8 12:06:17 it- mgatedomc 
> heartbeat: [4514]: info: Link it- mgatedomc:eth1 up.
> it- mgatedomc:~ # tail /var/log/messages May  8 12:06:16 it- mgatedomc 
> heartbeat: [4514]: info: G_main_add_TriggerHandler:  Added signal 
> manual handler May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> G_main_add_TriggerHandler:  Added signal manual handler May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp 
> failed, recreating.
> May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP 
> Broadcast heartb eat started on port 694 (694) interface eth1 May  8 
> 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
> heartb eat closed on port 694 interface eth1 -  Status: 1 May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal 
> handler for signal 17 May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> Local status now set to: ' up'
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 
> up.
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update 
> for node it-  mgatedom: status active May  8 12:06:17 it- mgatedomc 
> heartbeat: [4514]: info: Link it- mgatedomc:eth1 up.
> it- mgatedomc:~ # tail /var/log/messages May  8 12:06:16 it- mgatedomc 
> heartbeat: [4514]: info: G_main_add_TriggerHandler:  Added signal 
> manual handler May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> G_main_add_TriggerHandler:  Added signal manual handler May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp 
> failed, recreating.
> May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP 
> Broadcast heartb eat started on port 694 (694) interface eth1 May  8 
> 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast 
> heartb eat closed on port 694 interface eth1 -  Status: 1 May  8 12:06:16 it- 
> mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal 
> handler for signal 17 May  8 12:06:16 it- mgatedomc heartbeat: [4514]: info: 
> Local status now set to: ' up'
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 
> up.
> May  8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update 
> for node it-  mgatedom: status active May  8 12:06:17 it- mgatedomc 
> heartbeat: [4514]: info: Link it- mgatedomc:eth1 up.
> it- mgatedomc:~ # tail /var/log/messages May  8 12:07:06 it- mgatedomc 
> heartbeat: [4514]: debug: hist- >ackseq =0 May  8 12:07:06 it- 
> mgatedomc heartbeat: [4514]: debug: hist- >lowseq =0, hist- >hi 
> seq=103 May  8 12:07:06 it- mgatedomc heartbeat: [4514]: debug: 
> expecting from it- mgatedo m May  8 12:07:06 it- mgatedomc heartbeat: 
> [4514]: debug: it's ackseq=0 May  8 12:07:06 it- mgatedomc heartbeat: [4514]: 
> debug:
> May  8 12:07:06 it- mgatedomc heartbeat: [4514]: debug: hist- >ackseq 
> =0 May  8 12:07:06 it- mgatedomc heartbeat: [4514]: debug: hist- 
> >lowseq =0, hist- >hi seq=104 May  8 12:07:06 it- mgatedomc heartbeat: 
> [4514]: debug: expecting from it- mgatedo m May  8 12:07:06 it- 
> mgatedomc heartbeat: [4514]: debug: it's ackseq=0 May  8 12:07:06 it- 
> mgatedomc heartbeat: [4514]: debug:
>
>
>
> The line that says "expecting from it- mgatedom" confuses me.
>
> crm_mon shows "Not Connected".
>
> netstat - n - l | grep 694 shows that udp 694 is there.
>
> The strange thing is if I stop both of them and start it on IT- MGATEDOMC 
> first, then it will come up just fine and then when I start it on IT- 
> MGATEDOM, it has the above issue.
>
> Any ideas?
>
> Thank you,
> Eric...
>
> _______________________________________________
> Linux- HA mailing list
> Linux- [EMAIL PROTECTED] ha.org
> http://lists.linux- ha.org/mailman/listinfo/linux- ha See also: 
> http://linux- ha.org/ReportingProblems
>



------------------------------

Message: 6
Date: Tue, 15 May 2007 14:55:29 +0200
From: Benjamin Watine <[EMAIL PROTECTED]>
Subject: [Linux-HA] What heartbeat version to install now ?
To: General Linux-HA mailing list <[email protected]>
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi

I'm about to install Heartbeat on a fresh Debian 4 system, and I was wondering 
wich version of heartbeat I should install ?
Some bugs have been corrected since v2.0.8, and I would like to install these 
corrections too. So should I install v2.0.8 and apply the patches that I want, 
or install the latest development version ?

If you think there's no major problem installing latest snapshot, where can I 
find it ? here ? : http://hg.linux-ha.org/dev/archive/tip.tar.bz2
Any other version to suggest ?

Thanks in advance.

Benjamin


------------------------------

Message: 7
Date: Tue, 15 May 2007 09:06:53 -0400
From: "Benjamin Lawetz" <[EMAIL PROTECTED]>
Subject: RE: [Linux-HA] MySQL Master Master
To: "'General Linux-HA mailing list'" <[email protected]>
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain;       charset="US-ASCII"

Hi Markus,

        If I recall you need to reset the failed resource after you manually 
clean it up. Your monitor failed, couldn't restart it, heartbeat determines 
there is a problem with this resource on this node that it cannot do anything 
to fix. You have to manually fix it and then reset the failed resource.

>From the http://www.linux-ha.org/v2/AdminTools/crm_resource page

12. Resetting a failed resource after having been manually cleaned up
        crm_resource -C -H c001n02 -r my_frist_ip


> -----Original Message-----
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED] On Behalf Of Markus W.
> Sent: May 15, 2007 6:39 AM
> To: General Linux-HA mailing list
> Subject: Re: [Linux-HA] MySQL Master Master
> 
> Starting from scratch:
> 
> OS: Linux, RHEL 4, Kernel 2.6.9-55.EL
> HA: RPM Installation from
> http://dev.centos.org/centos/4/testing/i386/RPMS/heartbeat-2.0
> .8-2.el4.centos
> 
> Configuration, Log, Cibadmin Files: see Attachements
>  - xx.xx.xx.xx: Ping IP
>  - yy.yy.yy.yy: Cluster IP
>  - zz.zz.zz.zz: Cluster Broadcast
> 
> Problem:
> Edit /etc/init.d/httpd on node 1 to return 1 on startup (simulate 
> error) Stop httpd on node1 => heartbeat try to restart httpd on node 1 
> (fail)  => httpd or the group Web failover to node2 Reset 
> /etc/init.d/httpd on node 1 to normal behaviour Start httpd on node1 
> Set node2 in standby mode => heartbeat wont failback to node1
> 
> Best regards,
> 
> Markus
> 
> Dejan Muhamedagic schrieb:
> > On Mon, May 14, 2007 at 11:01:16AM +0200, Markus W. wrote:
> >   
> >> Ok, I understand nothing. I have the same problem with apache like 
> >> mysql. If apache run on the first node and something goes wrong on 
> >> that node apache would switch to the second node - great.
> Apache wont
> >> failback to the first node if I repair the first node and
> switch the
> >> second node into standby mode - bad.
> >>     
> >
> > With default settings, resources should move back to their
> preferred
> > node, once that one is live again. You'd probably want to post the 
> > logs and the configuration. See
> http://linux-ha.org/ReportingProblems
> >
> >   
> >> Anywhere on the ha "universe" page there was an
> information about this. 
> >> But I don't understand why heartbeat dont try the first node just 
> >> once again. If the first node is ok why heartbeat shouln'd
> move the
> >> resource back to the first node? Ok, if the first node
> isn't ok again
> >> I would understand heartbeat is given up to run the
> resource anywhere.
> >>
> >> Thanks
> >>
> >> Benjamin Lawetz schrieb:
> >>     
> >>> I have a vague impression that you might run into
> problems with the
> >>> dummy mysql script. From memory (and one of the gurus here will 
> >>> correct me if I'm
> >>> wrong) heartbeat can call "status" on startup or certain
> occasions.
> >>>
> >>> So having status return an "All OK" when the ressource
> agent should
> >>> not be running might cause unexpected behaviour. You
> might need to
> >>> implement a dummy start and stop and status of just touching or 
> >>> deleting a fake pid file and returning the status
> according to this. 
> >>> You could then implement your function in the monitor part of the 
> >>> script.
> >>>
> >>> But I may be wrong.
> >>>
> >>>  
> >>>       
> >>>> -----Original Message-----
> >>>> From: [EMAIL PROTECTED]
> >>>> [mailto:[EMAIL PROTECTED] On Behalf
> Of Markus W.
> >>>> Sent: May 11, 2007 7:17 AM
> >>>> To: General Linux-HA mailing list
> >>>> Subject: Re: [Linux-HA] MySQL Master Master
> >>>>
> >>>> Hi Benjamin,
> >>>>
> >>>> Wow! It rocks!! Thanks!!!
> >>>>
> >>>> As information the lsb dummy mysql ha script:
> >>>>
> >>>> case "$1" in
> >>>>  start)
> >>>>    exit 0
> >>>>    ;;
> >>>>  stop)
> >>>>    exit 0
> >>>>    ;;
> >>>>  status)
> >>>>    status mysqld
> >>>>    if [ $? -eq "0" ]; then
> >>>>        /usr/sbin/attrd_updater -n mysql_running -d 3s -v 1
> >>>>        exit 0
> >>>>    else
> >>>>        /usr/sbin/attrd_updater -n mysql_running -d 3s -v 0
> >>>>        exit 3
> >>>>    fi
> >>>>    ;;
> >>>>  *)
> >>>>    echo $"Usage: $0 {start|stop|status} (start|stop faked)"
> >>>>    exit 1
> >>>> esac
> >>>>
> >>>> --
> >>>> Markus
> >>>>
> >>>>
> >>>> Benjamin Lawetz schrieb:
> >>>>    
> >>>>         
> >>>>> Hi Markus,
> >>>>>
> >>>>>         I ran into the same problem. Didn't find any
> better way than to
> >>>>> modify the monitoring script of mysql and add in the case
> >>>>>      
> >>>>>           
> >>>> of a failure:
> >>>>    
> >>>>         
> >>>>> /usr/sbin/attrd_updater -n mysql_running -d 5s -v 0
> >>>>>
> >>>>> And in the case of a success:
> >>>>>
> >>>>> /usr/sbin/attrd_updater -n mysql-mod_running -d 5s -v 1
> >>>>>
> >>>>> The running the monitor script as a clone:
> >>>>>
> >>>>>       <clone id="mysql">
> >>>>>         <instance_attributes id="mysql">
> >>>>>           <attributes>
> >>>>>             <nvpair id="mysql-clone_node_max" 
> name="clone_node_max"
> >>>>> value="1"/>
> >>>>>           </attributes>
> >>>>>         </instance_attributes>
> >>>>>         <primitive id="mysql-child" provider="heartbeat" 
> >>>>>      
> >>>>>           
> >>>> class="ocf"
> >>>>    
> >>>>         
> >>>>> type="mysql">
> >>>>>           <operations>
> >>>>>             <op id="mysql-child-monitor" name="monitor" 
> >>>>>      
> >>>>>           
> >>>> interval="20s"
> >>>>    
> >>>>         
> >>>>> timeout="40s" prereq="nothing">
> >>>>>               <instance_attributes
> id="mysql-child-monitor-attr">
> >>>>>               </instance_attributes>
> >>>>>             </op>
> >>>>>             <op id="mysql-child-start" name="start" 
> >>>>>      
> >>>>>           
> >>>> prereq="nothing"/>
> >>>>    
> >>>>         
> >>>>>           </operations>
> >>>>>         </primitive>
> >>>>>       </clone>
> >>>>>
> >>>>> And then had a constraint:
> >>>>>
> >>>>>       <rsc_location rsc="group_1" id="cli-stop2-group_1">
> >>>>>         <rule score="-INFINITY" id="cli-stop2-rule-group_1">
> >>>>>           <expression operation="lte" value="0" 
> >>>>>      
> >>>>>           
> >>>> id="cli-stop2-expr-group_1"
> >>>>    
> >>>>         
> >>>>> attribute="mysql_running"/>
> >>>>>         </rule>
> >>>>>       </rsc_location>
> >>>>>
> >>>>> This will run the monitor on every node and set the score
> >>>>>      
> >>>>>           
> >>>> to -INFINITY
> >>>>    
> >>>>         
> >>>>> for the node where mysql fails.
> >>>>>
> >>>>> If mysql comes back online though, the "mysql_running" will
> >>>>>      
> >>>>>           
> >>>> be set to "1"
> >>>>    
> >>>>         
> >>>>> but I don't think it will trigger a recalculation of
> the scores. 
> >>>>> Haven't figured out yet how to cause this.
> >>>>>
> >>>>>
> >>>>> Hope this helps
> >>>>>  
> >>>>>      
> >>>>>           
> >>>> _______________________________________________
> >>>> Linux-HA mailing list
> >>>> [email protected]
> >>>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> >>>> See also: http://linux-ha.org/ReportingProblems
> >>>>    
> >>>>         
> >>> _______________________________________________
> >>> Linux-HA mailing list
> >>> [email protected]
> >>> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> >>> See also: http://linux-ha.org/ReportingProblems
> >>>
> >>>  
> >>>       
> >> _______________________________________________
> >> Linux-HA mailing list
> >> [email protected]
> >> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> >> See also: http://linux-ha.org/ReportingProblems
> >>     
> >
> >   
> 
> 



------------------------------

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

End of Linux-HA Digest, Vol 42, Issue 54
****************************************
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[Linux-HA] unsubscribe

Reply via email to