unsubscribe
Mit freundlichen Grüßen Zweckverband Kommunale Datenverarbeitung Oldenburg (KDO) Der Verbandsgeschäftsführer i. A. Jürgen Grabowski Systemtechniker UNIX Elsässer Straße 66 26121 Oldenburg Telefon 0441 9714-238 Telefax 0441 9714-17238 E-Mail [EMAIL PROTECTED] Internet http://www.kdo.de Der Inhalt dieser E-Mail ist vertraulich und ausschließlich für den im Adressfeld bezeichneten Adressaten bestimmt. Wenn Sie nicht der vorgesehene Adressat dieser E-Mail oder dessen Vertreter sein sollten, so beachten Sie bitte, dass jede Form der Kenntnisnahme, Veröffentlichung, Vervielfältigung oder Weitergabe des Inhalts dieser E-Mail unzulässig ist. Wir bitten Sie, sich in diesem Fall mit dem Absender der E-Mail in Verbindung zu setzen und diese Mail sowie alle Kopien davon zu vernichten. -----Ursprüngliche Nachricht----- Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Im Auftrag von [EMAIL PROTECTED] Gesendet: Dienstag, 15. Mai 2007 15:07 An: [email protected] Betreff: Linux-HA Digest, Vol 42, Issue 54 Send Linux-HA mailing list submissions to [email protected] To subscribe or unsubscribe via the World Wide Web, visit http://lists.linux-ha.org/mailman/listinfo/linux-ha or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than "Re: Contents of Linux-HA digest..." Today's Topics: 1. Re: Remove node from cluster (Max Hofer) 2. Bonding for High Availability (Jan Kalcic) 3. Re: Bonding for High Availability (Diego Julian Remolina) 4. Re: Re: Fwd: Linux-HA Bug report (Dan Gahlinger) 5. Re: NewToHA2 (Eric Marcus) 6. What heartbeat version to install now ? (Benjamin Watine) 7. RE: MySQL Master Master (Benjamin Lawetz) ---------------------------------------------------------------------- Message: 1 Date: Tue, 15 May 2007 12:40:48 +0200 From: Max Hofer <[EMAIL PROTECTED]> Subject: Re: [Linux-HA] Remove node from cluster To: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="ansi_x3.4-1968" Mariusz, could you please integrate this info in the Wiki? On Tuesday 15 May 2007, Mariusz Mart wrote: > Work fine:) > Thanks > > >>> "Andrew Beekhof" <[EMAIL PROTECTED]> 07-05-15 10:05 >>> > On 5/12/07, Mariusz Mart <[EMAIL PROTECTED]> wrote: > > Hi! > > > > I have 3 node cluster and I would like to remove one node from CRM > > database. Is there a simple solution for that? > > I can just stop it and not use, but it will be better if there is no > > extra node in hb_gui;) > > you need to stop the node > then you need to run hb_delnode (i'm not sure of the options) then you > need to delete the node from the CIB using > cibadmin -D -o nodes -X '<node uname="yournodename"/>' > and > cibadmin -D -o status -X '<node_status uname="yournodename"/>' ------------------------------ Message: 2 Date: Tue, 15 May 2007 11:50:42 +0200 From: Jan Kalcic <[EMAIL PROTECTED]> Subject: [Linux-HA] Bonding for High Availability To: General Linux-HA mailing list <[email protected]> Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1 Hi People, I got my cluster running with heartbeat2 but using haresources file. The resources I've created are IPAddr, drbd and apache but doesn't matter. What I need is double the network interface eth0 on each machine, create a bond interface on this and then create virtual IPAddr on the the bond interface. Is this possibile? What I'm looking for is create a High Availability environment in a multiple switch topology network. Regards, Jan ------------------------------ Message: 3 Date: Tue, 15 May 2007 07:13:50 -0400 From: Diego Julian Remolina <[EMAIL PROTECTED]> Subject: Re: [Linux-HA] Bonding for High Availability To: General Linux-HA mailing list <[email protected]> Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Hi Jan, It is very possible. I do this on a machine running RHEL4. I actually go a step further and do tagged VLANs (802.1q) on top of the bonded interface, so my configuration is like this: [EMAIL PROTECTED] network-scripts]# cat ifcfg-eth0 DEVICE=eth0 ONBOOT=yes BOOTPROTO=none MASTER=bond0 SLAVE=yes USERCTL=no [EMAIL PROTECTED] network-scripts]# cat ifcfg-eth1 DEVICE=eth1 ONBOOT=yes BOOTPROTO=none MASTER=bond0 SLAVE=yes USERCTL=no [EMAIL PROTECTED] network-scripts]# cat ifcfg-eth2 DEVICE=eth2 ONBOOT=yes BOOTPROTO=none MASTER=bond0 SLAVE=yes USERCTL=no [EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0 DEVICE=bond0 ONBOOT=yes BOOTPROTO=none USERCTL=no [EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0.431 DEVICE=bond0.431 BOOTPROTO=static ONBOOT=yes IPADDR=X.Y.131.21 NETMASK=255.255.255.0 TYPE=Ethernet VLAN=yes [EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0.432 DEVICE=bond0.432 BOOTPROTO=static ONBOOT=yes IPADDR=X.Y.132.21 NETMASK=255.255.255.0 TYPE=Ethernet VLAN=yes [EMAIL PROTECTED] network-scripts]# cat ifcfg-bond0.433 DEVICE=bond0.433 BOOTPROTO=static ONBOOT=no IPADDR=X.Y.133.21 NETMASK=255.255.255.0 TYPE=Ethernet VLAN=yes For the haresources or cib.xml file you must use the IPaddr2 script for the virtual IPs as shown here (It is of importance to define the different VLANs on top of the bonded interface in my case): [EMAIL PROTECTED] network-scripts]# cat /etc/ha.d/haresources node1_fqdn IPaddr2::X.Y.131.20/24/bond0.431:0 IPaddr2::X.Y.132.20/24/bond0.432:0 IPaddr2::X.Y.133.20/24/bond0.433:0 drbddisk::export Filesystem::/dev/drbd0::/export::ext3::rw,usrquota,grpquota,acl,user_xattr saslauthd cyrus-imapd clamd mimedefang sendmail node2_fqdn IPaddr2::X.Y.131.19/24/bond0.431:0 IPaddr2::X.Y.132.19/24/bond0.432:1 IPaddr2::X.Y.145.133/24/bond0.433:2 drbddisk::web Filesystem::/dev/drbd1::/web::ext3::rw,usrquota,grpquota,acl,user_xattr mysqld httpd If you have only bonding and use one virtual IP, then the IPaddr script should be enough, since you will only end up having bond0 and bond0:0 as your virtual interface. The need of the IPaddr2 script arises when you need to determine the name of the virtual interfaces so that heartbeat stops and starts the correct virtual IPs correctly. You must also check your kernel and the bonding modes available for your network cards. Some bonding modes require special switch configurations. Some modes give you better throughput and other modes are oriented towards availability. Check the readme files for the bonding module in your kernel documentation. HTH, Diego Jan Kalcic wrote: > Hi People, > > I got my cluster running with heartbeat2 but using haresources file. > The resources I've created are IPAddr, drbd and apache but doesn't matter. > What I need is double the network interface eth0 on each machine, > create a bond interface on this and then create virtual IPAddr on the > the bond interface. Is this possibile? > What I'm looking for is create a High Availability environment in a > multiple switch topology network. > > Regards, > Jan > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems ------------------------------ Message: 4 Date: Tue, 15 May 2007 08:16:30 -0400 From: "Dan Gahlinger" <[EMAIL PROTECTED]> Subject: Re: [Linux-HA] Re: Fwd: Linux-HA Bug report To: "General Linux-HA mailing list" <[email protected]> Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1; format=flowed how about just 2.0.8.5 ? Although I suppose it's possible this 2.0.9 that you built will "become" the "real" 2.0.9 at some point? Dan. On 5/15/07, Lars Marowsky-Bree <[EMAIL PROTECTED]> wrote: > > On 2007-05-15T10:47:43, Andrew Beekhof <[EMAIL PROTECTED]> wrote: > > > In fairness to SUSE, it was solely my doing while I learnt how to > > use the openSUSE build service. The version there isnt an official > > SUSE release either. > > > > To be honest I didnt know what to call it... 2.0.8 seemed wrong > > (since there was an extra 4-5 months development in there) but there > > is no real 2.0.9 either... oh well. > > Well, the version in mercurial does identify itself as 2.0.9 when build. > Maybe, to reflect the intermediate state, 2.0.8.999 would have been > better. > > > Sincerely, > Lars > > -- > Teamlead Kernel, SuSE Labs, Research and Development SUSE LINUX > Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg) "Experience is > the name everyone gives to their mistakes." -- Oscar Wilde > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > ------------------------------ Message: 5 Date: Tue, 15 May 2007 08:25:52 -0400 From: "Eric Marcus" <[EMAIL PROTECTED]> Subject: [Linux-HA] Re: NewToHA2 To: <[email protected]> Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=US-ASCII Hello! Yes, it was the firewall. Apparently my IPChains config didn't stick and the FW is off now. I thought I updated the list but I don't know if I am replying correctly. Am I supposed to put the Subject in like I am doing on this one and clean up the body a bit like this? I am plugging along slowly with HA 2. Its fun working with new stuff and luckily I am being paid to do this! Now I am having some EVMS issue that is posted in the list. I also have a call with Novell too. A two pronged attack to that issue on my part I suppose. Thanks for the reply. 3. Re: NewToHA2 (Andrew Beekhof) Message: 3 Date: Tue, 15 May 2007 11:44:20 +0200 From: "Andrew Beekhof" <[EMAIL PROTECTED]> Subject: Re: [Linux- HA] NewToHA2 To: "General Linux- HA mailing list" <linux- [EMAIL PROTECTED] ha.org> Message- ID: <[EMAIL PROTECTED]> Content- Type: text/plain; charset=ISO- 8859- 1; format=flowed its almost always a firewall. try stopping the firewall completely and see if the problem persists. On 5/8/07, Eric Marcus <[EMAIL PROTECTED]> wrote: > > Hello, I am new to HA2 and am having some configuration issues. I installed > HA2 (2.0.8- 1) on two Suse 10 (SLES10) machines using Alan's Education > Project Screencast (http://www.linux- > ha.org/Education/Newbie/InstallHeartbeatScreencast) > > I think I have a node configuration issue even though it is in ha.cf. I am > very familiar with Novell Cluster Services. The problem I outline below > makes me think that both of the nodes are trying to be the "Master" but I > don't how to fix this. I've spent a week on this and am feeling very stupid! > Here goes..... > > My ha.cf file for the 2 servers shows > > use_logd yes > bcast eth1 > node it- mgatedom it- mgatedomc > crm on > > > The logd.cf shows > > logfacility daemon > > > The authkeys show > > auth 1 > 1 sha1 cluster1 > > > Now, when I start it up on IT- MGATEDOM, it shows "done" > > crm_mon shows only 1 node configured and after a couple minutes the "Current > DC: NONE" becomes "Current DC: it- mgatedom" with 0 resources configured. It > still shows 1 node, not 2. > > Then I go to IT- MGATEDOMC to start it up...... It says "done" and when I > do a tail /var/log/message I see this > > > > it- mgatedomc:~ # /etc/init.d/heartbeat start Starting High- > Availability services: > > done > > it- mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it- mgatedomc > heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal > manual handler May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp > failed, recreating. > May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP > Broadcast heartb eat started on port 694 (694) interface eth1 May 8 > 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast > heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal > handler for signal 17 May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > Local status now set to: ' up' > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 > up. > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update > for node it- mgatedom: status active May 8 12:06:17 it- mgatedomc > heartbeat: [4514]: info: Link it- mgatedomc:eth1 up. > it- mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it- mgatedomc > heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal > manual handler May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp > failed, recreating. > May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP > Broadcast heartb eat started on port 694 (694) interface eth1 May 8 > 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast > heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal > handler for signal 17 May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > Local status now set to: ' up' > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 > up. > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update > for node it- mgatedom: status active May 8 12:06:17 it- mgatedomc > heartbeat: [4514]: info: Link it- mgatedomc:eth1 up. > it- mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it- mgatedomc > heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal > manual handler May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp > failed, recreating. > May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP > Broadcast heartb eat started on port 694 (694) interface eth1 May 8 > 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast > heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal > handler for signal 17 May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > Local status now set to: ' up' > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 > up. > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update > for node it- mgatedom: status active May 8 12:06:17 it- mgatedomc > heartbeat: [4514]: info: Link it- mgatedomc:eth1 up. > it- mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it- mgatedomc > heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal > manual handler May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp > failed, recreating. > May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP > Broadcast heartb eat started on port 694 (694) interface eth1 May 8 > 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast > heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal > handler for signal 17 May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > Local status now set to: ' up' > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 > up. > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update > for node it- mgatedom: status active May 8 12:06:17 it- mgatedomc > heartbeat: [4514]: info: Link it- mgatedomc:eth1 up. > it- mgatedomc:~ # tail /var/log/messages May 8 12:06:16 it- mgatedomc > heartbeat: [4514]: info: G_main_add_TriggerHandler: Added signal > manual handler May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > G_main_add_TriggerHandler: Added signal manual handler May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: Removing /var/run/heartbea t/rsctmp > failed, recreating. > May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP > Broadcast heartb eat started on port 694 (694) interface eth1 May 8 > 12:06:16 it- mgatedomc heartbeat: [4514]: info: glib: UDP Broadcast > heartb eat closed on port 694 interface eth1 - Status: 1 May 8 12:06:16 it- > mgatedomc heartbeat: [4514]: info: G_main_add_SignalHandler: Added signal > handler for signal 17 May 8 12:06:16 it- mgatedomc heartbeat: [4514]: info: > Local status now set to: ' up' > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Link it- mgatedom:eth1 > up. > May 8 12:06:17 it- mgatedomc heartbeat: [4514]: info: Status update > for node it- mgatedom: status active May 8 12:06:17 it- mgatedomc > heartbeat: [4514]: info: Link it- mgatedomc:eth1 up. > it- mgatedomc:~ # tail /var/log/messages May 8 12:07:06 it- mgatedomc > heartbeat: [4514]: debug: hist- >ackseq =0 May 8 12:07:06 it- > mgatedomc heartbeat: [4514]: debug: hist- >lowseq =0, hist- >hi > seq=103 May 8 12:07:06 it- mgatedomc heartbeat: [4514]: debug: > expecting from it- mgatedo m May 8 12:07:06 it- mgatedomc heartbeat: > [4514]: debug: it's ackseq=0 May 8 12:07:06 it- mgatedomc heartbeat: [4514]: > debug: > May 8 12:07:06 it- mgatedomc heartbeat: [4514]: debug: hist- >ackseq > =0 May 8 12:07:06 it- mgatedomc heartbeat: [4514]: debug: hist- > >lowseq =0, hist- >hi seq=104 May 8 12:07:06 it- mgatedomc heartbeat: > [4514]: debug: expecting from it- mgatedo m May 8 12:07:06 it- > mgatedomc heartbeat: [4514]: debug: it's ackseq=0 May 8 12:07:06 it- > mgatedomc heartbeat: [4514]: debug: > > > > The line that says "expecting from it- mgatedom" confuses me. > > crm_mon shows "Not Connected". > > netstat - n - l | grep 694 shows that udp 694 is there. > > The strange thing is if I stop both of them and start it on IT- MGATEDOMC > first, then it will come up just fine and then when I start it on IT- > MGATEDOM, it has the above issue. > > Any ideas? > > Thank you, > Eric... > > _______________________________________________ > Linux- HA mailing list > Linux- [EMAIL PROTECTED] ha.org > http://lists.linux- ha.org/mailman/listinfo/linux- ha See also: > http://linux- ha.org/ReportingProblems > ------------------------------ Message: 6 Date: Tue, 15 May 2007 14:55:29 +0200 From: Benjamin Watine <[EMAIL PROTECTED]> Subject: [Linux-HA] What heartbeat version to install now ? To: General Linux-HA mailing list <[email protected]> Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Hi I'm about to install Heartbeat on a fresh Debian 4 system, and I was wondering wich version of heartbeat I should install ? Some bugs have been corrected since v2.0.8, and I would like to install these corrections too. So should I install v2.0.8 and apply the patches that I want, or install the latest development version ? If you think there's no major problem installing latest snapshot, where can I find it ? here ? : http://hg.linux-ha.org/dev/archive/tip.tar.bz2 Any other version to suggest ? Thanks in advance. Benjamin ------------------------------ Message: 7 Date: Tue, 15 May 2007 09:06:53 -0400 From: "Benjamin Lawetz" <[EMAIL PROTECTED]> Subject: RE: [Linux-HA] MySQL Master Master To: "'General Linux-HA mailing list'" <[email protected]> Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="US-ASCII" Hi Markus, If I recall you need to reset the failed resource after you manually clean it up. Your monitor failed, couldn't restart it, heartbeat determines there is a problem with this resource on this node that it cannot do anything to fix. You have to manually fix it and then reset the failed resource. >From the http://www.linux-ha.org/v2/AdminTools/crm_resource page 12. Resetting a failed resource after having been manually cleaned up crm_resource -C -H c001n02 -r my_frist_ip > -----Original Message----- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of Markus W. > Sent: May 15, 2007 6:39 AM > To: General Linux-HA mailing list > Subject: Re: [Linux-HA] MySQL Master Master > > Starting from scratch: > > OS: Linux, RHEL 4, Kernel 2.6.9-55.EL > HA: RPM Installation from > http://dev.centos.org/centos/4/testing/i386/RPMS/heartbeat-2.0 > .8-2.el4.centos > > Configuration, Log, Cibadmin Files: see Attachements > - xx.xx.xx.xx: Ping IP > - yy.yy.yy.yy: Cluster IP > - zz.zz.zz.zz: Cluster Broadcast > > Problem: > Edit /etc/init.d/httpd on node 1 to return 1 on startup (simulate > error) Stop httpd on node1 => heartbeat try to restart httpd on node 1 > (fail) => httpd or the group Web failover to node2 Reset > /etc/init.d/httpd on node 1 to normal behaviour Start httpd on node1 > Set node2 in standby mode => heartbeat wont failback to node1 > > Best regards, > > Markus > > Dejan Muhamedagic schrieb: > > On Mon, May 14, 2007 at 11:01:16AM +0200, Markus W. wrote: > > > >> Ok, I understand nothing. I have the same problem with apache like > >> mysql. If apache run on the first node and something goes wrong on > >> that node apache would switch to the second node - great. > Apache wont > >> failback to the first node if I repair the first node and > switch the > >> second node into standby mode - bad. > >> > > > > With default settings, resources should move back to their > preferred > > node, once that one is live again. You'd probably want to post the > > logs and the configuration. See > http://linux-ha.org/ReportingProblems > > > > > >> Anywhere on the ha "universe" page there was an > information about this. > >> But I don't understand why heartbeat dont try the first node just > >> once again. If the first node is ok why heartbeat shouln'd > move the > >> resource back to the first node? Ok, if the first node > isn't ok again > >> I would understand heartbeat is given up to run the > resource anywhere. > >> > >> Thanks > >> > >> Benjamin Lawetz schrieb: > >> > >>> I have a vague impression that you might run into > problems with the > >>> dummy mysql script. From memory (and one of the gurus here will > >>> correct me if I'm > >>> wrong) heartbeat can call "status" on startup or certain > occasions. > >>> > >>> So having status return an "All OK" when the ressource > agent should > >>> not be running might cause unexpected behaviour. You > might need to > >>> implement a dummy start and stop and status of just touching or > >>> deleting a fake pid file and returning the status > according to this. > >>> You could then implement your function in the monitor part of the > >>> script. > >>> > >>> But I may be wrong. > >>> > >>> > >>> > >>>> -----Original Message----- > >>>> From: [EMAIL PROTECTED] > >>>> [mailto:[EMAIL PROTECTED] On Behalf > Of Markus W. > >>>> Sent: May 11, 2007 7:17 AM > >>>> To: General Linux-HA mailing list > >>>> Subject: Re: [Linux-HA] MySQL Master Master > >>>> > >>>> Hi Benjamin, > >>>> > >>>> Wow! It rocks!! Thanks!!! > >>>> > >>>> As information the lsb dummy mysql ha script: > >>>> > >>>> case "$1" in > >>>> start) > >>>> exit 0 > >>>> ;; > >>>> stop) > >>>> exit 0 > >>>> ;; > >>>> status) > >>>> status mysqld > >>>> if [ $? -eq "0" ]; then > >>>> /usr/sbin/attrd_updater -n mysql_running -d 3s -v 1 > >>>> exit 0 > >>>> else > >>>> /usr/sbin/attrd_updater -n mysql_running -d 3s -v 0 > >>>> exit 3 > >>>> fi > >>>> ;; > >>>> *) > >>>> echo $"Usage: $0 {start|stop|status} (start|stop faked)" > >>>> exit 1 > >>>> esac > >>>> > >>>> -- > >>>> Markus > >>>> > >>>> > >>>> Benjamin Lawetz schrieb: > >>>> > >>>> > >>>>> Hi Markus, > >>>>> > >>>>> I ran into the same problem. Didn't find any > better way than to > >>>>> modify the monitoring script of mysql and add in the case > >>>>> > >>>>> > >>>> of a failure: > >>>> > >>>> > >>>>> /usr/sbin/attrd_updater -n mysql_running -d 5s -v 0 > >>>>> > >>>>> And in the case of a success: > >>>>> > >>>>> /usr/sbin/attrd_updater -n mysql-mod_running -d 5s -v 1 > >>>>> > >>>>> The running the monitor script as a clone: > >>>>> > >>>>> <clone id="mysql"> > >>>>> <instance_attributes id="mysql"> > >>>>> <attributes> > >>>>> <nvpair id="mysql-clone_node_max" > name="clone_node_max" > >>>>> value="1"/> > >>>>> </attributes> > >>>>> </instance_attributes> > >>>>> <primitive id="mysql-child" provider="heartbeat" > >>>>> > >>>>> > >>>> class="ocf" > >>>> > >>>> > >>>>> type="mysql"> > >>>>> <operations> > >>>>> <op id="mysql-child-monitor" name="monitor" > >>>>> > >>>>> > >>>> interval="20s" > >>>> > >>>> > >>>>> timeout="40s" prereq="nothing"> > >>>>> <instance_attributes > id="mysql-child-monitor-attr"> > >>>>> </instance_attributes> > >>>>> </op> > >>>>> <op id="mysql-child-start" name="start" > >>>>> > >>>>> > >>>> prereq="nothing"/> > >>>> > >>>> > >>>>> </operations> > >>>>> </primitive> > >>>>> </clone> > >>>>> > >>>>> And then had a constraint: > >>>>> > >>>>> <rsc_location rsc="group_1" id="cli-stop2-group_1"> > >>>>> <rule score="-INFINITY" id="cli-stop2-rule-group_1"> > >>>>> <expression operation="lte" value="0" > >>>>> > >>>>> > >>>> id="cli-stop2-expr-group_1" > >>>> > >>>> > >>>>> attribute="mysql_running"/> > >>>>> </rule> > >>>>> </rsc_location> > >>>>> > >>>>> This will run the monitor on every node and set the score > >>>>> > >>>>> > >>>> to -INFINITY > >>>> > >>>> > >>>>> for the node where mysql fails. > >>>>> > >>>>> If mysql comes back online though, the "mysql_running" will > >>>>> > >>>>> > >>>> be set to "1" > >>>> > >>>> > >>>>> but I don't think it will trigger a recalculation of > the scores. > >>>>> Haven't figured out yet how to cause this. > >>>>> > >>>>> > >>>>> Hope this helps > >>>>> > >>>>> > >>>>> > >>>> _______________________________________________ > >>>> Linux-HA mailing list > >>>> [email protected] > >>>> http://lists.linux-ha.org/mailman/listinfo/linux-ha > >>>> See also: http://linux-ha.org/ReportingProblems > >>>> > >>>> > >>> _______________________________________________ > >>> Linux-HA mailing list > >>> [email protected] > >>> http://lists.linux-ha.org/mailman/listinfo/linux-ha > >>> See also: http://linux-ha.org/ReportingProblems > >>> > >>> > >>> > >> _______________________________________________ > >> Linux-HA mailing list > >> [email protected] > >> http://lists.linux-ha.org/mailman/listinfo/linux-ha > >> See also: http://linux-ha.org/ReportingProblems > >> > > > > > > ------------------------------ _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems End of Linux-HA Digest, Vol 42, Issue 54 **************************************** _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
