> >     I've been running some tests on my heartbeat setup with STONITH.
> > When I go live, there will a serial connection, a crossover 
> ethernet 
> > and the main ethernet for heartbeat. For the purpose of my 
> tests, I've 
> > changed the config just to broadcast the heartbeat on the 
> crossover cable.
> >     Let's say I have node1 running resource1 and node2 
> running resource2 
> > with node2 being the DC. When I shutdown the interface:
> > 
> >     - Node1 notices the break, complains and doesn't do much else
> >     - Node2 notices the break, complains and STONITHs Node1
> 
> Interesting. Do you have logs/config?

Tried getting you some this morning, but everything works hunky-dory... Must
have been a keyboard/chair interface problem. I guess a week-ends rest does
wonders for the mind.

> > I know a quorum server would allow a 3rd opinion in the 
> decision, but 
> > I feel this is added complexity (in 2-3 years the only 
> failures I've 
> > had on these services was because of the redundancy protection. The 
> > services have been going strong).
> 
> Unfortunately, that is often the case.

Yeah, looking into that now. I guess that's the way to go.
I'm actually having a problem with the TLS.

I've generated the certificates as indicated on
http://www.linux-ha.org/QuorumServerGuide quorum server starts up fine, but
when I start up a heartbeat node, I get in the quorumd logs: 

quorumd: [10660]: WARN: handshake failed
quorumd: [10660]: ERROR: on_listen tls handshake failed

It seems I messed something up in the certificate generation, I re-read the
instructions and tried again without any success. Anyone run into this
problem before ?
(I did set the common name of the certificates to the name of the cluster)

On the quorum server /etc/ha.d/quorumd.conf:
cluster         mysqlcluster
version         2_0_8
interval        1000
timeout         5000
takeover        3000
giveup          2000
nodenum         3
weight          300

On each node /etc/ha.d/ha.cf :
#serial /dev/ttyS0
keepalive 2
warntime 10
deadtime 30
initdead 120
baud 19200
ucast eth0 10.10.10.226
#ucast eth1 192.168.6.2
udpport 694
cluster mysqlcluster
quorum_server quorum.domain.com
auto_failback on
node mysql1.domain.com
node mysql2.domain.com
ping 10.10.10.3
ping 10.10.10.1
crm yes

Thanks again for your help
-- 
Benjamin
TéliPhone inc.


--------------
N'envoyé pas de courriel à l'adresse qui suit, sinon vous serez
automatiquement mis sur notre liste noire.
[EMAIL PROTECTED]
Do not send an email to the email above or you will automatically be
blacklisted.

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to