On 2011-10-13 12:38, Thomas wrote:
> Hello,
> 
> I am using 3-node corosync/pacemaker cluster setup. Repeatedly one of 
> the nodes refuses to join the cluster. Here is a snippet from the log file:
> 
> Oct 13 12:34:03 sh2 crmd: [2292]: info: crm_timer_popped: Welcomed: 1, 
> Integrated: 0
> Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State 
> transition S_INTEGRATION -> S_FINALIZE_JOIN [ input=I_INTEGRATED 
> cause=C_TIMER_POPPED origin=crm_timer_popped ]
> Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: Progressed 
> to state S_FINALIZE_JOIN after C_TIMER_POPPED
> Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_state_transition: 1 cluster 
> nodes failed to respond to the join offer.
> Oct 13 12:34:03 sh2 crmd: [2292]: info: ghash_print_node:   Welcome 
> reply not received from: sh2 6
> Oct 13 12:34:03 sh2 crmd: [2292]: WARN: do_log: FSA: Input I_ELECTION_DC 
> from do_dc_join_finalize() received in state S_FINALIZE_JOIN
> Oct 13 12:34:03 sh2 crmd: [2292]: info: do_state_transition: State 
> transition S_FINALIZE_JOIN -> S_INTEGRATION [ input=I_ELECTION_DC 
> cause=C_FSA_INTERNAL origin=do_dc_join_finalize ]
> Oct 13 12:34:03 sh2 crmd: [2292]: info: do_dc_join_offer_all: join-7: 
> Waiting on 1 outstanding join acks
> 
> Any idea what I should look after?
> 
> Networking (both rings) seems to work just fine.

"Seems to"? Have you confirmed with "corosync-cfgtool -s" on all nodes?

> Versions used are: 
> corosync 1.2.1-4 & pacemaker 1.0.9.1+hg15626-1 from current version of 
> debian squeeze.

Please upgrade to the versions in squeeze-backports at the earliest
convenience.

Cheers,
Florian

-- 
Need help with Corosync?
http://www.hastexo.com/now
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to