I am in the testing stage of my 2 node HA cluster.  I am running
heartbeat 2.1.3_3 and DRBD 8.0.8.  My highly available resources are

1 IP address
sshd ( I have a secondary admin sshd process running on a different port)
a custom java application

We are also running rsync over ssh as in rsync -av --rsh="ssh ..."

When a client is connected and rsyncing data I issue an hb_takeover
from the secondary node.  Everything swaps over to the new machine
just fine.  We rerun the client and we get a connection timeout
message.  Then I run hb_takeover from the new secondary node (initial
primary) and again all resources swap over successfully. We try the
client again and it works.

We have a Watchguard Firewall between the client and the cluster.
Behind the firewall I am able to ssh from the secondary node to the
primary node on the internal ip address that is a resource.  I have
full connectivity between the machines on all ip addresses.

I feel this is an ARP cache issue on the firewall.

My question to the masses is this.

Does/Can heartbeat do any upstream ARP management at its router?
If not how can one programatically flush the ARP cache on a firewall
from another machine?  Is this possible?

regards,

Doug



-- 
What profits a man if he gains the whole world yet loses his soul?
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to