[please preserve the CC when replying, thanks]
Hi-
I'm trying to set up a new cluster with OpenAIS 1.1.1, Corosync 1.1.2, and
LVM 2.02.56. Whenever I try to bring up the cluster (using aisexec, which
sets the COROSYNC_DEFAULT_CONFIG_IFACE environment variable and then runs
corosync), the nodes all seem to immediately find each other based on the
debugging output, but no other programs are able to use the cluster. In
particular, clvmd startup fails with:
r...@black-mesa:/etc/corosync# clvmd -d
CLVMD[6f0d780]: Jan 4 01:33:01 CLVMD started
CLVMD[6f0d780]: Jan 4 01:33:01 Cannot join clvmd process group: 6
CLVMD[6f0d780]: Jan 4 01:33:01 Can't initialise cluster interface
Can't initialise cluster interface
and even corosync-cfgtool fails with the same error:
r...@black-mesa:/etc/corosync# corosync-cfgtool -s
Printing ring status.
Local node ID 16779018
Could not get the ring status, the error is: 6
Looking at the source, I see that error 6 is "SA_AIS_ERR_TRY_AGAIN". I'm
not sure how long I'm expected to wait for the cluster to start
functioning; I've already tried waiting 5 minutes, which seems like it
should be long enough.
I've also tried configuring pacemaker (by adding a service block to
/etc/corosync/corosync.conf), and it starts functioning within 30 seconds
of bringing up corosync, even while clvmd and corosync-cfgtool refuse to
start.
When I don't set the COROSYNC_DEFAULT_CONFIG_IFACE environment variable
and just load the openais_lck service, the cluster does eventually start
working, and I can even start clvmd, but the locking appears to be
nonfunctional - running two lvcreate operations simultaneously causes a
deadlock every time.
I've attached a copy of my configuration file below. Is there something
missing in my configuration, or is this a bug in OpenAIS/Corosync?
(The reason we're trying to get CLVM + OpenAIS working at all is that
we've experienced numerous deadlocks with clvmd on dlm, which we are
already running.)
Thanks,
--Quentin
r...@black-mesa:/etc/corosync# cat corosync.conf
compatibility: none
totem {
version: 2
secauth: off
interface {
ringnumber: 0
bindnetaddr: 10.7.0.0
mcastaddr: 226.94.1.1
mcastport: 5405
}
}
logging {
to_syslog: yes
syslog_facility: daemon
debug: on
timestamp: on
}
amf {
mode: disabled
}
service {
name: pacemaker
ver: 0
}
_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais