My two firewalls with HA worked like a charm across several versions since Fedora 9. Just after upgrading to Fedora 17, they REBOOT CONTINUALLY.
> # for a in heartbeat heartbeat-libs resource-agents; do rpm -q $a; done > heartbeat-3.0.4-1.fc17.2.i686 > heartbeat-libs-3.0.4-1.fc17.2.i686 > resource-agents-3.9.2-2.fc17.1.i686 Investigating the problem I noticed an execution of: > ifconfig eth1:0: (notice the colon at end of interface name) wich results in ha-log these lines: ---------------------------------------------------------------------------------------------------------------------------------------------------------------------- ResourceManager[2051]: 2012/06/08_08:36:15 info: Releasing resource group: gw2 10.1.1.20/24/eth1 ResourceManager[2051]: 2012/06/08_08:36:15 info: Running /etc/ha.d/resource.d/IPaddr 10.1.1.20/24/eth1 stop IPaddr[2116]: 2012/06/08_08:36:15 INFO: ifconfig eth1:0: down IPaddr[2116]: 2012/06/08_08:36:15 WARNING: IP Address 1.1.1.20 NOT released: rc=255 IPaddr[2090]: 2012/06/08_08:36:15 ERROR: Generic error ResourceManager[2051]: 2012/06/08_08:36:15 ERROR: Return code 1 from /etc/ha.d/resource.d/IPaddr ResourceManager[2051]: 2012/06/08_08:36:16 info: Retrying failed stop operation [10.1.1.20/24/eth1] ResourceManager[2051]: 2012/06/08_08:36:16 info: Running /etc/ha.d/resource.d/IPaddr 10.1.1.20/24/eth1 stop IPaddr[2208]: 2012/06/08_08:36:16 INFO: ifconfig eth1:0: down IPaddr[2208]: 2012/06/08_08:36:16 WARNING: IP Address 10.1.1.20 NOT released: rc=255 IPaddr[2182]: 2012/06/08_08:36:16 ERROR: Generic error [couple seconds after, the machine REBOOTS] ---------------------------------------------------------------------------------------------------------------------------------------------------------------------- As consequence "/sbin/reboot" is executed ALWAYS to free the ip address (function RecoverFromFailedStop). Looking thru the code (/usr/lib/ocf/resource.d/heartbeat/IPaddr) I found: > ifname=`echo "$ifname" | sed s'%:$%%'` but only in function "find_interface_solaris". As I'm using Linux, to fix the problem, just include the SAME line above in function "find_interface_generic". I added the above line in "find_interface_generic" and the command: > ifconfig eth1:0: becomes > ifconfig eth1:0 So IPaddr/ifconfig was able to release the ip address and everything worked as before. Don't forget to update the Fedora 17 Updates repository. Thanks Guilsson _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
