On 07/30/2010 02:09 PM, Ofer Inbar wrote:
> I forgot to add something useful, appended below.
>    -- Cos
>
> On Fri, Jul 30, 2010 at 05:02:04PM -0400, I wrote:
>> I've gotten further, into another roadblock:
>> Starting the cman service gets as far as successfully running ccsd,
>> but then at the next step where it checks cman status, it fails.
>>
>> | Starting cluster:
>> |    Loading modules... done
>> |    Mounting configfs... done
>> |    Starting ccsd... done
>> |    Starting cman... failed
>> |
>> |                                                            [FAILED]
>>

Is aisexec already running (ie: did you set it to start via init 
script?).  cman starts aisexec, and therefore, the initscript for 
aisexec should be disabled.

system logs would be helpful.

Regards
-steve

>> Any attempt to use cman_tool gives:
>> cman_tool: Cannot open connection to cman, is it running ?
>>
>> strace shows that it fails here:
>> connect(3, {sa_family=AF_FILE, path="/var/run/cman_client"}, 110) = -1 
>> ECONNREFUSED (Connection refused)
>>
>> strace -fp on ccsd shows the same thing, in a loop, like this:
>>
>> [pid  7670] rt_sigprocmask(SIG_BLOCK, [CHLD], [], 8) = 0
>> [pid  7670] rt_sigaction(SIGCHLD, NULL, {SIG_DFL, [], 0}, 8) = 0
>> [pid  7670] rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0
>> [pid  7670] nanosleep({1, 0}, {1, 0})   = 0
>> [pid  7670] socket(PF_FILE, SOCK_STREAM, 0) = 9
>> [pid  7670] fcntl(9, F_SETFD, FD_CLOEXEC) = 0
>> [pid  7670] connect(9, {sa_family=AF_FILE, path="/var/run/cman_client"}, 
>> 110) = -1 ECONNREFUSED (Connection refused)
>> [pid  7670] close(9)                    = 0
>>
>> Named sockets are there and seem to have the right permissions:
>> srw------- 1 root root 0 Jul 30 15:21 /var/run/cman_admin=
>> srw-rw---- 1 root root 0 Jul 30 15:21 /var/run/cman_client=
>>
>> I've been googling for ideas.  I don't really understand what's
>> supposed to using this named socket to communicate with what,
>> but I think this is a way for clients to talk to the cman plugin
>> in aisexec.  aisexec seems okay on all my cluster nodes, and they're
>> all communicating with each other, but I also don't know the right
>> ways to poke at it to check its health.
>>
>> I'm continuing to experiment, and Google for ideas, but maybe
>> someone here can point me in the right direction.
>>    -- Cos
>
> I noticed that lsof on my aisexec daemons on this cluster shows that
> they do *not* have /var/run/cman_client or /var/run/cman_admin open.
> lsof shows nothing has those open.  When I checked on a working
> cluster I saw that aisexec does have both of those open.
>
> I tried rm'ing both and letting them be recreated when I restarted the
> aisexec and cman services, but I still get ECONNREFUSED on cman_client
> when I try running any cman_tool command.
>    -- Cos
> _______________________________________________
> Openais mailing list
> [email protected]
> https://lists.linux-foundation.org/mailman/listinfo/openais

_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to