Hi readers,
i caused a split brain on my testing machine, to see how it would react.
I disabled on both machines the eth1-interface, over which the heartbeat
happened.
So the DRBD still was connected (over the eth0-interface) but, hearbeat
was split-brained.
After I saw, what I expected (heartbeat failed to mount drbd on the
secondary node, because the primary was still alive) I enabled the
interfaces again and expected the nodes to recover the situation
somehow..but this failed
I can restart one or both heartbeat-instances now, but they aren't able
to connect to each other :(
Following crm_mon -1 on the nodes:
First node (nodekrz)
============
Last updated: Tue Feb 19 17:31:03 2008
Current DC: noderz (91d062c3-ad0a-4c24-b759-acada7f19101)
2 Nodes configured.
2 Resources configured.
============
Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): online
Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): OFFLINE
Master/Slave Set: drbd_master_slave
drbd_r0:0 (heartbeat::ocf:drbd): Master noderz
drbd_r0:1 (heartbeat::ocf:drbd): Stopped
Resource Group: Filesystem_and_IP
Filesystem (heartbeat::ocf:Filesystem): Started noderz
Cluster_IP (heartbeat::ocf:IPaddr): Started noderz
Second node: (noderz)
============
Last updated: Tue Feb 19 17:30:17 2008
Current DC: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d)
2 Nodes configured.
2 Resources configured.
============
Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): OFFLINE
Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online
Master/Slave Set: drbd_master_slave
drbd_r0:0 (heartbeat::ocf:drbd): Master nodekrz
drbd_r0:1 (heartbeat::ocf:drbd): Stopped
Resource Group: Filesystem_and_IP
Filesystem (heartbeat::ocf:Filesystem): Started nodekrz
Cluster_IP (heartbeat::ocf:IPaddr): Started nodekrz
They are able to ping each other over the heartbeat-link.
Like I said, restarting heartbeat on one or both nodes at the same time
doesn't change anything.
So what to do to solve this situation?
Thanks for replies
Florian
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems