On Mon, May 07, 2007 at 01:18:38PM -0400, Benjamin Lawetz wrote: > > > I've been running some tests on my heartbeat setup with STONITH. > > > When I go live, there will a serial connection, a crossover > > ethernet > > > and the main ethernet for heartbeat. For the purpose of my > > tests, I've > > > changed the config just to broadcast the heartbeat on the > > crossover cable. > > > Let's say I have node1 running resource1 and node2 > > running resource2 > > > with node2 being the DC. When I shutdown the interface: > > > > > > - Node1 notices the break, complains and doesn't do much else > > > - Node2 notices the break, complains and STONITHs Node1 > > > > Interesting. Do you have logs/config? > > Tried getting you some this morning, but everything works hunky-dory... Must > have been a keyboard/chair interface problem. I guess a week-ends rest does > wonders for the mind. > > > > I know a quorum server would allow a 3rd opinion in the > > decision, but > > > I feel this is added complexity (in 2-3 years the only > > failures I've > > > had on these services was because of the redundancy protection. The > > > services have been going strong). > > > > Unfortunately, that is often the case. > > Yeah, looking into that now. I guess that's the way to go. > I'm actually having a problem with the TLS. > > I've generated the certificates as indicated on > http://www.linux-ha.org/QuorumServerGuide quorum server starts up fine, but > when I start up a heartbeat node, I get in the quorumd logs: > > quorumd: [10660]: WARN: handshake failed > quorumd: [10660]: ERROR: on_listen tls handshake failed > > It seems I messed something up in the certificate generation, I re-read the > instructions and tried again without any success. Anyone run into this > problem before ? > (I did set the common name of the certificates to the name of the cluster)
You can try to debug the problem using openssl (s_client and s_server commands). > On the quorum server /etc/ha.d/quorumd.conf: > cluster mysqlcluster > version 2_0_8 > interval 1000 > timeout 5000 > takeover 3000 > giveup 2000 > nodenum 3 > weight 300 > > On each node /etc/ha.d/ha.cf : > #serial /dev/ttyS0 > keepalive 2 > warntime 10 > deadtime 30 > initdead 120 > baud 19200 > ucast eth0 10.10.10.226 > #ucast eth1 192.168.6.2 > udpport 694 > cluster mysqlcluster > quorum_server quorum.domain.com > auto_failback on > node mysql1.domain.com > node mysql2.domain.com > ping 10.10.10.3 > ping 10.10.10.1 > crm yes > > Thanks again for your help > -- > Benjamin > T?liPhone inc. > > > -------------- > N'envoy? pas de courriel ? l'adresse qui suit, sinon vous serez > automatiquement mis sur notre liste noire. > [EMAIL PROTECTED] > Do not send an email to the email above or you will automatically be > blacklisted. > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems -- Dejan _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
