Hi

I have a 2 node qdiskd cluster (OS is RHES 5.0) with
2x heartbeat cross cables between the 2 nodes

Currently we manually issue the following commands
to start the cluster services in the sequence below :
a) cd /etc/init.d
b) ./cman start
c) ./clvmd ...
d) ./qdiskd ...
e) ./rgmanager ...

& on the primary node, issue "clusvcadm ..... Oracle_Service"
to start oracle services which will also mount the SAN partition.

Occasionally, we ran into the error below & cluster breaks on
both nodes (ie SAN partition unmounted on both and Oracle
services stopped on both) :

lurgmgrd[5843]: <emerg> #1: Quorum Dissolved


What's wrong?

Usually when this happens, I could usually make the first node
rejoin the cluster + mount the SAN partition but the 2nd node
usually can't rejoin the cluster/mount SAN and has to be rebooted
and reissued with the commands a-e for it to rejoin the cluster.
Thanks for any insights
--
Linux-cluster mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/linux-cluster

Reply via email to