Hi readers,

i caused a split brain on my testing machine, to see how it would react.
I disabled on both machines the eth1-interface, over which the heartbeat
happened.

So the DRBD still was connected (over the eth0-interface) but, hearbeat
was split-brained.

After I saw, what I expected (heartbeat failed to mount drbd on the
secondary node, because the primary was still alive) I enabled the
interfaces again and expected the nodes to recover the situation
somehow..but this failed

I can restart one or both heartbeat-instances now, but they aren't able
to connect to each other :(

Following crm_mon -1 on the nodes:


First node (nodekrz)

============
Last updated: Tue Feb 19 17:31:03 2008
Current DC: noderz (91d062c3-ad0a-4c24-b759-acada7f19101)
2 Nodes configured.
2 Resources configured.
============

Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): online
Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): OFFLINE

Master/Slave Set: drbd_master_slave
    drbd_r0:0   (heartbeat::ocf:drbd):  Master noderz
    drbd_r0:1   (heartbeat::ocf:drbd):  Stopped
Resource Group: Filesystem_and_IP
    Filesystem  (heartbeat::ocf:Filesystem):    Started noderz
    Cluster_IP  (heartbeat::ocf:IPaddr):        Started noderz


Second node: (noderz)

============
Last updated: Tue Feb 19 17:30:17 2008
Current DC: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d)
2 Nodes configured.
2 Resources configured.
============

Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): OFFLINE
Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online

Master/Slave Set: drbd_master_slave
    drbd_r0:0   (heartbeat::ocf:drbd):  Master nodekrz
    drbd_r0:1   (heartbeat::ocf:drbd):  Stopped
Resource Group: Filesystem_and_IP
    Filesystem  (heartbeat::ocf:Filesystem):    Started nodekrz
    Cluster_IP  (heartbeat::ocf:IPaddr):        Started nodekrz

They are able to ping each other over the heartbeat-link.

Like I said, restarting heartbeat on one or both nodes at the same time
doesn't change anything.

So what to do to solve this situation?

Thanks for replies

Florian


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to