On 07/18/2011 12:17 PM, Jed Smith wrote: > Good morning, > > I am not subscribed to the list (yet, waiting on confirmation) so > please CC me on all replies. > > My employer has several deployments of Pacemaker on top of Corosync > and we have recently been hitting this: > > Jul 18 12:01:05 xxxx corosync[6065]: [TOTEM ] FAILED TO RECEIVE > Jul 18 12:01:15 xxxx corosync[6065]: last message repeated 15 times > Jul 18 12:01:15 xxxx corosync[6065]: [pcmk ] notice: > pcmk_peer_update: Transitional membership event on ring 268: memb=1, > new=0, lost=4
Is it possible that the switch dropped the multicast group, and didn't reform it fast enough to prevent the cluster from partitioning? -- Digimer E-Mail: [email protected] Freenode handle: digimer Papers and Projects: http://alteeve.com Node Assassin: http://nodeassassin.org "At what point did we forget that the Space Shuttle was, essentially, a program that strapped human beings to an explosion and tried to stab through the sky with fire and math?" _______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
