On Mon, May 07, 2007 at 01:18:38PM -0400, Benjamin Lawetz wrote:
> > >   I've been running some tests on my heartbeat setup with STONITH.
> > > When I go live, there will a serial connection, a crossover 
> > ethernet 
> > > and the main ethernet for heartbeat. For the purpose of my 
> > tests, I've 
> > > changed the config just to broadcast the heartbeat on the 
> > crossover cable.
> > >   Let's say I have node1 running resource1 and node2 
> > running resource2 
> > > with node2 being the DC. When I shutdown the interface:
> > > 
> > >   - Node1 notices the break, complains and doesn't do much else
> > >   - Node2 notices the break, complains and STONITHs Node1
> > 
> > Interesting. Do you have logs/config?
> 
> Tried getting you some this morning, but everything works hunky-dory... Must
> have been a keyboard/chair interface problem. I guess a week-ends rest does
> wonders for the mind.
> 
> > > I know a quorum server would allow a 3rd opinion in the 
> > decision, but 
> > > I feel this is added complexity (in 2-3 years the only 
> > failures I've 
> > > had on these services was because of the redundancy protection. The 
> > > services have been going strong).
> > 
> > Unfortunately, that is often the case.
> 
> Yeah, looking into that now. I guess that's the way to go.
> I'm actually having a problem with the TLS.
> 
> I've generated the certificates as indicated on
> http://www.linux-ha.org/QuorumServerGuide quorum server starts up fine, but
> when I start up a heartbeat node, I get in the quorumd logs: 
> 
> quorumd: [10660]: WARN: handshake failed
> quorumd: [10660]: ERROR: on_listen tls handshake failed
> 
> It seems I messed something up in the certificate generation, I re-read the
> instructions and tried again without any success. Anyone run into this
> problem before ?
> (I did set the common name of the certificates to the name of the cluster)

You can try to debug the problem using openssl (s_client and
s_server commands).

> On the quorum server /etc/ha.d/quorumd.conf:
> cluster         mysqlcluster
> version         2_0_8
> interval        1000
> timeout         5000
> takeover        3000
> giveup          2000
> nodenum         3
> weight          300
> 
> On each node /etc/ha.d/ha.cf :
> #serial /dev/ttyS0
> keepalive 2
> warntime 10
> deadtime 30
> initdead 120
> baud 19200
> ucast eth0 10.10.10.226
> #ucast eth1 192.168.6.2
> udpport 694
> cluster mysqlcluster
> quorum_server quorum.domain.com
> auto_failback on
> node mysql1.domain.com
> node mysql2.domain.com
> ping 10.10.10.3
> ping 10.10.10.1
> crm yes
> 
> Thanks again for your help
> -- 
> Benjamin
> T?liPhone inc.
> 
> 
> --------------
> N'envoy? pas de courriel ? l'adresse qui suit, sinon vous serez
> automatiquement mis sur notre liste noire.
> [EMAIL PROTECTED]
> Do not send an email to the email above or you will automatically be
> blacklisted.
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

-- 
Dejan
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to