My two firewalls with HA worked like a charm across several versions
since Fedora 9.
Just after upgrading to Fedora 17, they REBOOT CONTINUALLY.

> # for a in heartbeat heartbeat-libs resource-agents; do rpm -q $a; done
> heartbeat-3.0.4-1.fc17.2.i686
> heartbeat-libs-3.0.4-1.fc17.2.i686
> resource-agents-3.9.2-2.fc17.1.i686

Investigating the problem I noticed an execution of:
> ifconfig eth1:0:
(notice the colon at end of interface name) wich results in ha-log these lines:
----------------------------------------------------------------------------------------------------------------------------------------------------------------------
ResourceManager[2051]:  2012/06/08_08:36:15 info: Releasing resource
group: gw2 10.1.1.20/24/eth1
ResourceManager[2051]:  2012/06/08_08:36:15 info: Running
/etc/ha.d/resource.d/IPaddr 10.1.1.20/24/eth1 stop
IPaddr[2116]:   2012/06/08_08:36:15 INFO: ifconfig eth1:0: down
IPaddr[2116]:   2012/06/08_08:36:15 WARNING: IP Address 1.1.1.20 NOT
released: rc=255
IPaddr[2090]:   2012/06/08_08:36:15 ERROR:  Generic error
ResourceManager[2051]:  2012/06/08_08:36:15 ERROR: Return code 1 from
/etc/ha.d/resource.d/IPaddr
ResourceManager[2051]:  2012/06/08_08:36:16 info: Retrying failed stop
operation [10.1.1.20/24/eth1]
ResourceManager[2051]:  2012/06/08_08:36:16 info: Running
/etc/ha.d/resource.d/IPaddr 10.1.1.20/24/eth1 stop
IPaddr[2208]:   2012/06/08_08:36:16 INFO: ifconfig eth1:0: down
IPaddr[2208]:   2012/06/08_08:36:16 WARNING: IP Address 10.1.1.20 NOT
released: rc=255
IPaddr[2182]:   2012/06/08_08:36:16 ERROR:  Generic error
[couple seconds after, the machine REBOOTS]
----------------------------------------------------------------------------------------------------------------------------------------------------------------------
As consequence "/sbin/reboot" is executed ALWAYS to free the ip
address (function RecoverFromFailedStop).

Looking thru the code (/usr/lib/ocf/resource.d/heartbeat/IPaddr) I found:
> ifname=`echo "$ifname" | sed s'%:$%%'`
but only in function "find_interface_solaris".
As I'm using Linux, to fix the problem, just include the SAME line
above in function "find_interface_generic".

I added the above line in "find_interface_generic" and the command:
> ifconfig eth1:0:
becomes
> ifconfig eth1:0

So IPaddr/ifconfig was able to release the ip address and everything
worked as before.

Don't forget to update the Fedora 17 Updates repository.

Thanks
Guilsson
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to