[please preserve the CC when replying, thanks]

Hi-

I'm trying to set up a new cluster with OpenAIS 1.1.1, Corosync 1.1.2, and 
LVM 2.02.56. Whenever I try to bring up the cluster (using aisexec, which 
sets the COROSYNC_DEFAULT_CONFIG_IFACE environment variable and then runs 
corosync), the nodes all seem to immediately find each other based on the 
debugging output, but no other programs are able to use the cluster. In 
particular, clvmd startup fails with:

r...@black-mesa:/etc/corosync# clvmd -d
CLVMD[6f0d780]: Jan  4 01:33:01 CLVMD started
CLVMD[6f0d780]: Jan  4 01:33:01 Cannot join clvmd process group: 6
CLVMD[6f0d780]: Jan  4 01:33:01 Can't initialise cluster interface
   Can't initialise cluster interface

and even corosync-cfgtool fails with the same error:

r...@black-mesa:/etc/corosync# corosync-cfgtool -s
Printing ring status.
Local node ID 16779018
Could not get the ring status, the error is: 6

Looking at the source, I see that error 6 is "SA_AIS_ERR_TRY_AGAIN". I'm 
not sure how long I'm expected to wait for the cluster to start 
functioning; I've already tried waiting 5 minutes, which seems like it 
should be long enough.

I've also tried configuring pacemaker (by adding a service block to 
/etc/corosync/corosync.conf), and it starts functioning within 30 seconds 
of bringing up corosync, even while clvmd and corosync-cfgtool refuse to 
start.

When I don't set the COROSYNC_DEFAULT_CONFIG_IFACE environment variable 
and just load the openais_lck service, the cluster does eventually start 
working, and I can even start clvmd, but the locking appears to be 
nonfunctional - running two lvcreate operations simultaneously causes a 
deadlock every time.

I've attached a copy of my configuration file below. Is there something 
missing in my configuration, or is this a bug in OpenAIS/Corosync?

(The reason we're trying to get CLVM + OpenAIS working at all is that 
we've experienced numerous deadlocks with clvmd on dlm, which we are 
already running.)

Thanks,
--Quentin

r...@black-mesa:/etc/corosync# cat corosync.conf
compatibility: none

totem {
         version: 2
         secauth: off
         interface {
                 ringnumber: 0
                 bindnetaddr: 10.7.0.0
                 mcastaddr: 226.94.1.1
                 mcastport: 5405
         }
}

logging {
         to_syslog: yes
         syslog_facility: daemon
         debug: on
         timestamp: on
}

amf {
         mode: disabled
}
service {
         name: pacemaker
         ver: 0
}

_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to