Hi, On Tue, Feb 19, 2008 at 05:33:39PM +0100, Schmidt, Florian wrote: > Hi readers, > > i caused a split brain on my testing machine, to see how it would react. > I disabled on both machines the eth1-interface, over which the heartbeat > happened. > > So the DRBD still was connected (over the eth0-interface) but, hearbeat > was split-brained. > > After I saw, what I expected (heartbeat failed to mount drbd on the > secondary node, because the primary was still alive) I enabled the > interfaces again and expected the nodes to recover the situation > somehow..but this failed > > I can restart one or both heartbeat-instances now, but they aren't able > to connect to each other :(
Please provide the logs? Thanks, Dejan > Following crm_mon -1 on the nodes: > > > First node (nodekrz) > > ============ > Last updated: Tue Feb 19 17:31:03 2008 > Current DC: noderz (91d062c3-ad0a-4c24-b759-acada7f19101) > 2 Nodes configured. > 2 Resources configured. > ============ > > Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): online > Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): OFFLINE > > Master/Slave Set: drbd_master_slave > drbd_r0:0 (heartbeat::ocf:drbd): Master noderz > drbd_r0:1 (heartbeat::ocf:drbd): Stopped > Resource Group: Filesystem_and_IP > Filesystem (heartbeat::ocf:Filesystem): Started noderz > Cluster_IP (heartbeat::ocf:IPaddr): Started noderz > > > Second node: (noderz) > > ============ > Last updated: Tue Feb 19 17:30:17 2008 > Current DC: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d) > 2 Nodes configured. > 2 Resources configured. > ============ > > Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): OFFLINE > Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online > > Master/Slave Set: drbd_master_slave > drbd_r0:0 (heartbeat::ocf:drbd): Master nodekrz > drbd_r0:1 (heartbeat::ocf:drbd): Stopped > Resource Group: Filesystem_and_IP > Filesystem (heartbeat::ocf:Filesystem): Started nodekrz > Cluster_IP (heartbeat::ocf:IPaddr): Started nodekrz > > They are able to ping each other over the heartbeat-link. > > Like I said, restarting heartbeat on one or both nodes at the same time > doesn't change anything. > > So what to do to solve this situation? > > Thanks for replies > > Florian > > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
