Hi,

On Tue, Feb 19, 2008 at 05:33:39PM +0100, Schmidt, Florian wrote:
> Hi readers,
> 
> i caused a split brain on my testing machine, to see how it would react.
> I disabled on both machines the eth1-interface, over which the heartbeat
> happened.
> 
> So the DRBD still was connected (over the eth0-interface) but, hearbeat
> was split-brained.
> 
> After I saw, what I expected (heartbeat failed to mount drbd on the
> secondary node, because the primary was still alive) I enabled the
> interfaces again and expected the nodes to recover the situation
> somehow..but this failed
> 
> I can restart one or both heartbeat-instances now, but they aren't able
> to connect to each other :(

Please provide the logs?

Thanks,

Dejan

> Following crm_mon -1 on the nodes:
> 
> 
> First node (nodekrz)
> 
> ============
> Last updated: Tue Feb 19 17:31:03 2008
> Current DC: noderz (91d062c3-ad0a-4c24-b759-acada7f19101)
> 2 Nodes configured.
> 2 Resources configured.
> ============
> 
> Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): online
> Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): OFFLINE
> 
> Master/Slave Set: drbd_master_slave
>     drbd_r0:0   (heartbeat::ocf:drbd):  Master noderz
>     drbd_r0:1   (heartbeat::ocf:drbd):  Stopped
> Resource Group: Filesystem_and_IP
>     Filesystem  (heartbeat::ocf:Filesystem):    Started noderz
>     Cluster_IP  (heartbeat::ocf:IPaddr):        Started noderz
> 
> 
> Second node: (noderz)
> 
> ============
> Last updated: Tue Feb 19 17:30:17 2008
> Current DC: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d)
> 2 Nodes configured.
> 2 Resources configured.
> ============
> 
> Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): OFFLINE
> Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online
> 
> Master/Slave Set: drbd_master_slave
>     drbd_r0:0   (heartbeat::ocf:drbd):  Master nodekrz
>     drbd_r0:1   (heartbeat::ocf:drbd):  Stopped
> Resource Group: Filesystem_and_IP
>     Filesystem  (heartbeat::ocf:Filesystem):    Started nodekrz
>     Cluster_IP  (heartbeat::ocf:IPaddr):        Started nodekrz
> 
> They are able to ping each other over the heartbeat-link.
> 
> Like I said, restarting heartbeat on one or both nodes at the same time
> doesn't change anything.
> 
> So what to do to solve this situation?
> 
> Thanks for replies
> 
> Florian
> 
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to