On Fri, Dec 10, 2010 at 03:36:05AM +0000, Preeti Jain wrote: > Hello list, > I am testing network failure case by removing nic cable on one node and > getting > unwanted outcomes as whole cluster gets disturbed and resource appears to > move > on different nodes until it gets stabled on one node and it is also resulting > in > failback. > Like if i remove nic cable from node 1 then failover happens it takes some > time > to move to node 2 but when once again i plugin cable on node 1 a kind of > split > brain happens and resource take sometime to get stabled on node 1 resulting > failback which is again not desired as it should stay on node 2... > Every node says like other cluster nodes coming after partition
> part of log file on node 1 after nic plugin > heartbeat[2521]: 2010/12/08_16:50:02 CRIT: Cluster node Node2 returning after > partition. > heartbeat[2521]: 2010/12/08_16:50:02 info: For information on cluster > partitions, See URL: http://linux-ha.org/SplitBrain > heartbeat[2521]: 2010/12/08_16:50:02 WARN: Deadtime value may be too small. > heartbeat[2521]: 2010/12/08_16:50:02 info: See FAQ for information on tuning > deadtime. > heartbeat[2521]: 2010/12/08_16:50:02 info: URL: > http://linux-ha.org/FAQ#heavy_load > heartbeat[2521]: 2010/12/08_16:50:02 info: Link Node2:eth0 up. > heartbeat[2521]: 2010/12/08_16:50:02 WARN: Late heartbeat: Node Node2: > interval 781870 ms > heartbeat[2521]: 2010/12/08_16:50:02 info: Status update for node Node2: > status active > Any solution for this problem... Good idea: follow the links given in the log messages above. As stated there, Simple solution: multiple independent communication paths. Thorough solution: multiple independent communication paths, with redundancy within each path, plus some _independent_ stonith method. -- : Lars Ellenberg : LINBIT | Your Way to High Availability : DRBD/HA support and consulting http://www.linbit.com DRBD® and LINBIT® are registered trademarks of LINBIT, Austria. _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
