Re: [DRBD-user] DRBD 9: Bug or feature

Robert Altnoeder Mon, 05 Oct 2015 07:46:11 -0700

Taking a resource down on a primary node does not promote any othernodes, this has to be done either manually or by a cluster resource manager.

I was unable to reproduce the problem regarding the failed state changewhile taking down a resource.Is this a persistent problem (reproducible on your cluster every time,with every three-node resource)?

All the various drbdadm primary / up / down combinations I tried workedwith my setup (same DRBD9 commit on a custom built kernel 3.17-rc5).

If the problem persists, some more information on the exact steps youused to reproduce the problem and the details of the setup (kernelversion, distribution, etc.) would be interesting.


Best regards,
--
: Robert Altnöder
: Developer / Trainer
: LINBIT | Your Way to High Availability
:
: http://www.linbit.com

DRBD® and LINBIT® are registered trademarks of LINBIT.

On 09/16/2015 08:08 PM, [email protected] wrote:

While testing drbd 9.0,in a three node mesh, executing drbdadm down r0on the primary node does not promote other nodes to primary (no bigdeal as I'll do that with pacemaker) but drbdadm reports it cannotdetach the block device since it was primary.


/[root@mom3 chucks]# drbdadm status r0/
/This command will ignore resource names!/
/r0 role:Primary/
/  disk:UpToDate/
/  mom1 role:Secondary/
/    peer-disk:UpToDate/
/  mom2 role:Secondary/
/    peer-disk:UpToDate/

/[root@mom3 chucks]# drbdadm down r0/
/r0: State change failed: (-2) Need access to UpToDate data/
/additional info from kernel:/
/failed to detach/
/Command 'drbdsetup down r0' terminated with exit code 17/

even though now from another node the resource IS down
/[root@mom1 chucks]# drbdadm status/
/r0 role:Secondary/
/  disk:UpToDate/
/  mom2 role:Secondary/
/    peer-disk:UpToDate/
/  mom3 connection:Connecting/

and it seems to be unable to restore that it was primary, have topromote another node to primary and the old primary has to be takendown to secondary.



So if not a "bug" / "feature" what would be the appropriate sequence,
1) demote current primary to secondary
2) promote new node to primary
3) bring down old primary

I see some trouble brewing with this as I progress to pacemaker andbegin hard failover testing. (aka power / communication loss on 1 or 2nodes at a time)

-------------------------
particulars...............
---------------------------
drbd
version: 9.0.0 (api:2/proto:86-110)
GIT-hash: 360c65a035fc2dec2b93e839b5c7fae1201fa7d9
drbd-utils
Version: 8.9.3 (api:2)
GIT-hash: c11ba026bbbbc647b8112543df142f2185cb4b4b

[root@mom1 chucks]# drbdadm dump
# /etc/drbd.conf
global {
    usage-count yes;
}

common {
    net {
        protocol           C;
    }
}

# resource r0 on mom1: not ignored, not stacked
# defined at /etc/drbd.d/r0.res:1
resource r0 {
    on mom1 {
        node-id 0;
        device           /dev/drbd1 minor 1;
        disk             /dev/vg_submother1/local_storage;
        meta-disk        internal;
        address          ipv4 192.168.110.10:7789;
    }
    on mom2 {
        node-id 1;
        device           /dev/drbd1 minor 1;
        disk             /dev/vg_supermother2/local_storage;
        meta-disk        internal;
        address          ipv4 192.168.110.20:7789;
    }
    on mom3 {
        node-id 2;
        device           /dev/drbd1 minor 1;
        disk             /dev/vg_mom3/local_storage;
        meta-disk        internal;
        address          ipv4 192.168.110.30:7789;
    }
    connection-mesh {
        hosts mom1 mom2 mom3;
        net {
            use-rle       no;
        }
    }
}



_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

Re: [DRBD-user] DRBD 9: Bug or feature

Reply via email to