Hello,

I am using 3-node corosync/pacemaker cluster setup. Repeatedly one of 
the nodes refuses to join the cluster. Here is a snippet from the log file:

Oct 13 12:34:03 sh2 crmd: [2292]: info: crm_timer_popped: Welcomed: 1, 
Integrated: 0
Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State 
transition S_INTEGRATION -> S_FINALIZE_JOIN [ input=I_INTEGRATED 
cause=C_TIMER_POPPED origin=crm_timer_popped ]
Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: Progressed 
to state S_FINALIZE_JOIN after C_TIMER_POPPED
Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: 1 cluster 
nodes failed to respond to the join offer.
Oct 13 12:34:03 sh2 crmd: [2292]: info: ghash_print_node:   Welcome 
reply not received from: sh2 6
Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_log: FSA: Input I_ELECTION_DC 
from do_dc_join_finalize() received in state S_FINALIZE_JOIN
Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State 
transition S_FINALIZE_JOIN -> S_INTEGRATION [ input=I_ELECTION_DC 
cause=C_FSA_INTERNAL origin=do_dc_join_finalize ]
Oct 13 12:34:03 sh2 crmd: [2292]: info: do_dc_join_offer_all: join-7: 
Waiting on 1 outstanding join acks

Any idea what I should look after?

Networking (both rings) seems to work just fine. Versions used are: 
corosync 1.2.1-4 & pacemaker 1.0.9.1+hg15626-1 from current version of 
debian squeeze.

Any hint would be appreciated,

    Thomas

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to