Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Marco Marino
Furthermore there are logs from the secondary node: http://pastebin.com/A2ySXDCB Please compare time. It seems that also on the secondary node drbd goes to diskless mode. Why? 2016-09-20 8:44 GMT+02:00 Marco Marino : > Hi, logs can be found here: http://pastebin.com/BGR33jN6 > > @digimer: >

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Igor Cicimov
On 20 Sep 2016 5:00 pm, "Marco Marino" wrote: > > Furthermore there are logs from the secondary node: > > http://pastebin.com/A2ySXDCB > > > Please compare time. It seems that also on the secondary node drbd goes to diskless mode. Why? > In the secondary log you can see I/O errors too: Sep 7 19:

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Marco Marino
mmm... This means that I do not understood this policy. I thought that I/O error happens only on the primary node, but it seems that all nodes become diskless in this case. Why? Basically I have an I/O error on the primary node because I removed wrongly the ssd (cachecade) disk. Why also the second

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Lars Ellenberg
On Tue, Sep 20, 2016 at 11:13:38AM +0200, Marco Marino wrote: > mmm... This means that I do not understood this policy. I thought that I/O > error happens only on the primary node, but it seems that all nodes become > diskless in this case. Why? Basically I have an I/O error on the primary > node b

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Igor Cicimov
On Tue, Sep 20, 2016 at 7:13 PM, Marco Marino wrote: > mmm... This means that I do not understood this policy. I thought that I/O > error happens only on the primary node, but it seems that all nodes become > diskless in this case. Why? Basically I have an I/O error on the primary > node because

[DRBD-user] DRBD fencing prevents resource promotion in active/passive cluster

2016-09-20 Thread Auer, Jens
Hi, I am using a drbd device in an active/passive cluster setup with pacemaker. We have dedicated connections for corosync heartbeats, drbd and a 10GB data connection: - A bonded 10GB network card for data traffic that will be accessed via a virtual ip managed by pacemaker in 192.168.120.1/24.

[DRBD-user] drbd fencing stops promotion to master even when network connection is up

2016-09-20 Thread Auer, Jens
Hi, I am using a drbd device in an active/passive cluster setup with pacemaker. We have dedicated connections for corosync heartbeats, drbd and a 10GB data connection: - A bonded 10GB network card for data traffic that will be accessed via a virtual ip managed by pacemaker in 192.168.120.1/24.

Re: [DRBD-user] DRBD fencing prevents resource promotion in active/passive cluster

2016-09-20 Thread Digimer
On 20/09/16 07:07 AM, Auer, Jens wrote: > Hi, > > I am using a drbd device in an active/passive cluster setup with pacemaker. > We have dedicated connections for corosync heartbeats, drbd and a 10GB data > connection: > - A bonded 10GB network card for data traffic that will be accessed via a >

Re: [DRBD-user] drbd fencing stops promotion to master even when network connection is up

2016-09-20 Thread Auer, Jens
Hi, I've updated all drbd packages to the latest versions: MDA1PFP-S01 11:52:35 2551 0 ~ # yum list "*drbd*" Loaded plugins: langpacks, product-id, search-disabled-repos, subscription-manager Installed Packages drbd.x86_64

Re: [DRBD-user] DRBD fencing prevents resource promotion in active/passive cluster

2016-09-20 Thread Auer, Jens
Hi, > Don't disable fencing! > You need to configure and test stonith in pacemaker. Once that's > working, then you set DRBD's fencing to 'resource-and-stonith;' and > configure the 'crm-{un,}fence-handler.sh' un/fence handlers. > With this, if a node fails (and no, redundant network links is no

Re: [DRBD-user] DRBD fencing prevents resource promotion in active/passive cluster

2016-09-20 Thread Lars Ellenberg
On Tue, Sep 20, 2016 at 12:25:55PM +, Auer, Jens wrote: > Hi, > > > Don't disable fencing! > > > You need to configure and test stonith in pacemaker. Once that's > > working, then you set DRBD's fencing to 'resource-and-stonith;' and > > configure the 'crm-{un,}fence-handler.sh' un/fence hand

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Marco Marino
As told by Lars Ellenberg, one first problem with the configuration http://pastebin.com/r3N1gzwx is that on-io-error should be on-io-error call-local-io-error; and not detach. Furthermore, in the configuration there is also another error: fencing should be fencing resource-and-stonith; and not reso

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Lars Ellenberg
On Tue, Sep 20, 2016 at 05:17:53PM +0200, Marco Marino wrote: > As told by Lars Ellenberg, one first problem with the configuration > http://pastebin.com/r3N1gzwx > is that on-io-error should be > on-io-error call-local-io-error; > and not detach. Furthermore, in the configuration there is also ano

Re: [DRBD-user] Drbd/pacemaker active/passive san failover

2016-09-20 Thread Igor Cicimov
On Wed, Sep 21, 2016 at 1:17 AM, Marco Marino wrote: > As told by Lars Ellenberg, one first problem with the configuration > http://pastebin.com/r3N1gzwx > is that on-io-error should be > on-io-error call-local-io-error; > And in your specific case that would have shut down both servers since bo