Thanks for your reply.
--- On Thu, 3/3/11, Seb <[email protected]> wrote:
>
> There is no <quorumd> section in your config
> file?
No
> Have you been able to identify a quorum disk on the
> nodes?
There is no quorum disk allocated for this configuration. As mentioned,
only I know, quotum was alocated through command line etc.
>
> The host-priv.domain.org
> is in your /etc/hosts? on all nodes?
>
Yes.
> Why have they been rebooted? for
> maintenance/upgrade?
>
For maintenance. But before the reboot, the cluster service on that node was
not shutdown.
> Any iptable used?
>
No.
> Could you please provide the logs showing the start
> of the cluster service?
>
I am mentioning here one of the server's log , when ccs started.
_______________________________________________________________________________________________________
Mar 1 20:20:39 host ccsd[5287]: Starting ccsd 2.0.115:
Mar 1 20:20:39 host ccsd[5287]: Built: May 25 2010 04:32:00
Mar 1 20:20:39 host ccsd[5287]: Copyright (C) Red Hat, Inc. 2004 All rights
reserved.
Mar 1 20:20:39 host ccsd[5287]: cluster.conf (cluster name = xxxxxxx, version
= 21) found.
Mar 1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service RELEASE
'subrev 1887 version 0.80.6'
Mar 1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2002-2006 MontaVista
Software, Inc and contributors.
Mar 1 20:20:40 host openais[5302]: [MAIN ] Copyright (C) 2006 Red Hat, Inc.
Mar 1 20:20:40 host openais[5302]: [MAIN ] AIS Executive Service: started and
ready to provide service.
Mar 1 20:20:40 host openais[5302]: [MAIN ] Using default multicast address of
xxx.xxx.xxx.xx
Mar 1 20:20:40 host openais[5302]: [TOTEM] Token Timeout (10000 ms) retransmit
timeout (495 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] token hold (386 ms) retransmits
before loss (20 retrans)
Mar 1 20:20:40 host openais[5302]: [TOTEM] join (60 ms) send_join (0 ms)
consensus (20000 ms) merge (200 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] downcheck (1000 ms) fail to recv
const (50 msgs)
Mar 1 20:20:40 host openais[5302]: [TOTEM] seqno unchanged const (30
rotations) Maximum network MTU 1402
Mar 1 20:20:40 host openais[5302]: [TOTEM] window size per rotation (50
messages) maximum messages per rotation (17 messages)
Mar 1 20:20:40 host openais[5302]: [TOTEM] send threads (0 threads)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP token expired timeout (495 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP token problem counter (2000 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP threshold (10 problem count)
Mar 1 20:20:40 host openais[5302]: [TOTEM] RRP mode set to none.
Mar 1 20:20:40 host openais[5302]: [TOTEM] heartbeat_failures_allowed (0)
Mar 1 20:20:40 host openais[5302]: [TOTEM] max_network_delay (50 ms)
Mar 1 20:20:40 host openais[5302]: [TOTEM] HeartBeat is Disabled. To enable
set heartbeat_failures_allowed > 0
Mar 1 20:20:40 host openais[5302]: [TOTEM] Receive multicast socket recv
buffer size (262142 bytes).
Mar 1 20:20:40 host openais[5302]: [TOTEM] Transmit multicast socket send
buffer size (262142 bytes).
Mar 1 20:20:40 host openais[5302]: [TOTEM] The network interface
[192.168.xxx.x] is now up.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Created or loaded sequence id
6160.192.168.xxx.x for this ring.
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering GATHER state from 15.
Mar 1 20:20:40 host openais[5302]: [CMAN ] CMAN 2.0.115 (built May 25 2010
04:32:02) started
Mar 1 20:20:40 host openais[5302]: [MAIN ] Service initialized 'openais CMAN
membership service 2.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
extended virtual synchrony service'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
cluster membership service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
availability management framework B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
checkpoint service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais event
service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
distributed locking service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
message service B.01.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
configuration service'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
cluster closed process group service v1.01'
Mar 1 20:20:40 host openais[5302]: [SERV ] Service initialized 'openais
cluster config database access v1.01'
Mar 1 20:20:40 host openais[5302]: [SYNC ] Not using a virtual synchrony
filter.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Creating commit token because I am
the rep.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Saving state aru 0 high seq
received 0
Mar 1 20:20:40 host openais[5302]: [TOTEM] Storing new sequence id for ring
1814
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering COMMIT state.
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering RECOVERY state.
Mar 1 20:20:40 host openais[5302]: [TOTEM] position [0] member 192.168.xxx.x:
Mar 1 20:20:40 host openais[5302]: [TOTEM] previous ring seq 6160 rep
192.168.xxx.x
Mar 1 20:20:40 host openais[5302]: [TOTEM] aru 0 high delivered 0 received
flag 1
Mar 1 20:20:40 host openais[5302]: [TOTEM] Did not need to originate any
messages in recovery.
Mar 1 20:20:40 host openais[5302]: [TOTEM] Sending initial ORF token
Mar 1 20:20:40 host openais[5302]: [CLM ] CLM CONFIGURATION CHANGE
Mar 1 20:20:40 host openais[5302]: [CLM ] New Configuration:
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Left:
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Joined:
Mar 1 20:20:40 host openais[5302]: [CLM ] CLM CONFIGURATION CHANGE
Mar 1 20:20:40 host openais[5302]: [CLM ] New Configuration:
Mar 1 20:20:40 host openais[5302]: [CLM ] r(0) ip(192.168.xxx.x)
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Left:
Mar 1 20:20:40 host openais[5302]: [CLM ] Members Joined:
Mar 1 20:20:40 host openais[5302]: [CLM ] r(0) ip(192.168.xxx.x)
Mar 1 20:20:40 host openais[5302]: [SYNC ] This node is within the primary
component and will provide service.
Mar 1 20:20:40 host openais[5302]: [TOTEM] entering OPERATIONAL state.
Mar 1 20:20:40 host openais[5302]: [CLM ] got nodejoin message 192.168.xxx.x
Mar 1 20:20:41 host ccsd[5287]: Initial status:: Inquorate
Mar 1 20:20:41 host ccsd[5287]: Cluster is not quorate. Refusing connection.
Mar 1 20:20:41 host ccsd[5287]: Error while processing connect: Connection
refused
Mar 1 20:20:42 host ccsd[5287]: Cluster is not quorate. Refusing connection.
Mar 1 20:20:42 host ccsd[5287]: Error while processing connect: Connection
refused
Mar 1 20:20:42 host ccsd[5287]: Cluster is not quorate. Refusing connection.
Mar 1 20:20:42 host ccsd[5287]: Error while processing connect: Connection
refused
_______________________________________________________________________________________________________
Thanks again
--
Linux-cluster mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/linux-cluster