Hi, On Tue, Feb 24, 2009 at 12:44:22PM -0000, Stef Morrell wrote: > Hello, > > It seems I can't reboot my primary without losing all my services. > Primary goes down, secondary picks up and starts, then for some reason > immediately stops all resources. > > When the primary comes back up, both nodes are in standby mode and I > have to issue a hb_takeover on the primary to get resources running > again. If the secondary goes down, primary carries on servicing > resources as normal. Whilst the primary is down, it is impossible to > persuade heartbeat to start resources on the secondary. > > This is clearly a problem for kernel upgrades, hardware servicing etc. I > need to be able to take the primary down on occasion without losing all > resources. > > Using 2.1.4 with drbd 8.3.0 in a 2 node R1 style cluster. > > Attached: ha.cf, haresources & logfiles from primary & secondary nodes. > > Regards > > Stef > -- > Stefan Morrell | Operations Director > Tel: 0845 3452820 | Alpha Omega Computers Ltd > Fax: 0845 3452830 | Incorporating Level 5 Internet > [email protected] | [email protected] > > Standard Disclaimer: http://www.aoc-uk.com/16.asp > > Alpha Omega Computers Ltd, Unit 57, BBTC, Grange Road, Batley, WF17 6ER. > Registered in England No. 3867142. VAT No. GB734421454
> Feb 24 12:29:00 fedecks-2 ResourceManager[1695]: [1844]: debug: Starting > /etc/ha.d/resource.d/postgrey start > Feb 24 12:29:00 fedecks-2 ResourceManager[1695]: [1847]: debug: > /etc/ha.d/resource.d/postgrey start done. RC=9 > Feb 24 12:29:00 fedecks-2 ResourceManager[1695]: [1848]: ERROR: Return code 9 > from /etc/ha.d/resource.d/postgrey > Feb 24 12:29:00 fedecks-2 ResourceManager[1695]: [1849]: CRIT: Giving up > resources due to failure of postgrey You have to fix postgrey to exit with rc 0 on start of an already started service. Thanks, Dejan _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
