Hello

I have a problem when i'm launching a drbdadm verify on my ressources

This is my config :

root@ifprdstor6b:~# cat /etc/drbd.d/DSA601.res
resource DSA601 {
  protocol C;

  startup {
    wfc-timeout 0;
  }

  disk {
    on-io-error detach;
  }

  syncer {
    rate 400M;
    verify-alg md5;
  }

  on ifprdstor6a {
    device    /dev/drbd1;
    disk      /dev/sda;
    address   10.13.1.1:7788;
    meta-disk internal;
  }

  on ifprdstor6b {
    device    /dev/drbd1;
    disk      /dev/sda;
    address   10.13.1.2:7788;
    meta-disk internal;
  }
}

I have 8 ressources with the same configuration

I'm using Drbd over IB network and this is my network config :

root@ifprdstor6b:~# cat /etc/network/interfaces
# This file describes the network interfaces available on your system
# and how to activate them. For more information, see interfaces(5).

# The loopback network interface
auto lo
iface lo inet loopback

# The primary network interface
allow-hotplug eth0
iface eth0 inet static
    address 10.12.20.2
    netmask 255.255.255.0
    network 10.12.20.0
    broadcast 10.1.2.255
    gateway 10.12.20.254
    # dns-* options are implemented by the resolvconf package, if installed
    dns-nameservers 10.1.2.33 10.1.2.34
    dns-search lampiris.local

allow-hotplug ib0
iface ib0 inet static
  address 10.13.1.2
  netmask 255.255.255.0


Now when I start a verify with : drbdadm verify DSA601, it's OK and the verify complete.

But when I start the verify on the second ressouce with : drbdadm verify DSA602 it's blocking :

This is the : /proc/drbd

root@ifprdstor6a:~# cat /proc/drbd
version: 8.3.11 (api:88/proto:86-96)
srcversion: F937DCB2E5D83C6CCE4A6C9

 1: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
 2: cs:VerifyS ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
ns:0 nr:0 dw:0 dr:82296 al:0 bm:0 lo:1 pe:8706 ua:2048 ap:0 ep:1 wo:f oos:0 [>....................] verified: 0.1% (1713280/1713352)Mfinish: 386277:34:31 speed: 0 (0) want: 409,600 K/sec (stalled)
 3: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
 4: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
 5: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
 6: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
 7: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0
 8: cs:Connected ro:Primary/Secondary ds:UpToDate/UpToDate C r-----
    ns:0 nr:0 dw:0 dr:88 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:f oos:0


The ressource 2 is freezed, I can't access to the volume and make an LVM LV or something else. But I can use the other ressources.

I have this entry on the syslog :

Dec 3 09:39:23 ifprdstor6a kernel: [61078.583849] block drbd2: [drbd2_worker/3080] sock_sendmsg time expired, ko = 4294957143 Dec 3 09:39:29 ifprdstor6a kernel: [61084.574556] block drbd2: [drbd2_worker/3080] sock_sendmsg time expired, ko = 4294957142
Dec  3 09:39:31 ifprdstor6a OpenSM[2668]: SM port is down#012

Someone had this problem ? Any solutions

Thank's in advance

Matthieu Lejeune







_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

Reply via email to