On 07/18/2011 12:17 PM, Jed Smith wrote:
> Good morning,
> 
> I am not subscribed to the list (yet, waiting on confirmation) so
> please CC me on all replies.
> 
> My employer has several deployments of Pacemaker on top of Corosync
> and we have recently been hitting this:
> 
> Jul 18 12:01:05 xxxx corosync[6065]:   [TOTEM ] FAILED TO RECEIVE
> Jul 18 12:01:15 xxxx corosync[6065]: last message repeated 15 times
> Jul 18 12:01:15 xxxx corosync[6065]:   [pcmk  ] notice:
> pcmk_peer_update: Transitional membership event on ring 268: memb=1,
> new=0, lost=4

Is it possible that the switch dropped the multicast group, and didn't
reform it fast enough to prevent the cluster from partitioning?

-- 
Digimer
E-Mail:              [email protected]
Freenode handle:     digimer
Papers and Projects: http://alteeve.com
Node Assassin:       http://nodeassassin.org
"At what point did we forget that the Space Shuttle was, essentially,
a program that strapped human beings to an explosion and tried to stab
through the sky with fire and math?"
_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to