Hi

I solved it myself when I found this site
http://www.hastexo.com/resources/hints-and-kinks/solve-drbd-split-brain-4-steps

It was a split brain issue.

Regards

Dave

On 25/09/14 18:19, Dave Bamford wrote:
Hi
I am new to these lists but I have set up 2 simple DRBD systems before. This one is giving me grief!

I set up 2 servers in a simple raid, running Debian Jessie and linked via a network cable. Both servers can ping each other and DRBD was working perfectly. So I shut down both servers and now I cannot get it to work again.

Here is the error I get
drbd.service - Distributed Replicated Block Device
   Loaded: loaded (/lib/systemd/system/drbd.service; enabled)
Active: failed (Result: exit-code) since Thu 2014-09-25 18:11:49 BST; 2min 21s ago Process: 2304 ExecStop=/sbin/drbdadm down all (code=exited, status=0/SUCCESS) Process: 2250 ExecStart=/sbin/drbdadm sh-b-pri all (code=exited, status=0/SUCCESS) Process: 2339 ExecStart=/sbin/drbdadm wait-connect all (code=exited, status=5) Process: 2314 ExecStart=/sbin/drbdadm adjust-with-progress all (code=exited, status=0/SUCCESS) Process: 2310 ExecStartPre=/sbin/drbdadm sh-nop (code=exited, status=0/SUCCESS)
 Main PID: 2339 (code=exited, status=5)

Sep 25 18:11:32 patty systemd[1]: Starting Distributed Replicated Block Device...
Sep 25 18:11:32 patty drbdadm[2314]: [
Sep 25 18:11:32 patty drbdadm[2314]: create res: r0
Sep 25 18:11:32 patty drbdadm[2314]: prepare disk: r0
Sep 25 18:11:34 patty drbdadm[2314]: adjust disk: r0
Sep 25 18:11:34 patty drbdadm[2314]: adjust net: r0
Sep 25 18:11:34 patty drbdadm[2314]: ]
Sep 25 18:11:34 patty drbdadm[2339]: outdated-wfc-timeout has to be shorter than degr-wfc-timeout Sep 25 18:11:34 patty drbdadm[2339]: outdated-wfc-timeout implicitly set to degr-wfc-timeout (15s) Sep 25 18:11:49 patty systemd[1]: drbd.service: main process exited, code=exited, status=5/NOTINSSTALLED Sep 25 18:11:49 patty systemd[1]: Failed to start Distributed Replicated Block Device. Sep 25 18:11:49 patty systemd[1]: Unit drbd.service entered failed state.

and here is the resource file
resource r0 {
        protocol C;
        startup {
                wfc-timeout 15;
                degr-wfc-timeout 15;
                become-primary-on patty;
        }
        net {
                cram-hmac-alg sha1;
                shared-secret "secret";
        }
        on patty {
                disk /dev/vg0/store;
                device /dev/drbd0;
                address 10.0.0.1:7788;
                meta-disk internal;
        }
        on selma {
                disk /dev/vg0/store;
                device /dev/drbd0;
                address 10.0.0.2:7788;
                meta-disk internal;
        }

Any help would be much appreciated
And this is the result of cat /proc/drbd from the primary
version: 8.4.3 (api:1/proto:86-101)
srcversion: 88927CDF07AEA4F7F2580B2
 0: cs:WFConnection ro:Secondary/Unknown ds:UpToDate/DUnknown C r-----
ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:56887168
Thanks

Dave




_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user


_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

Reply via email to