Hello,

i'm using ocfs2 on top of lvm on top of drbd in a KVM virtual server and fighing with kernel bugs (ocfs, virtio) for weeks.

Now that seems start to work, i have found something more that is weird.

Kernel: 2.6.31.2
Drbd: 8.3.4

When starting drbd, i see thousands of those errors:

[ 4962.247701] end_request: I/O error, dev vda, sector 0 [ 4962.263811] end_request: I/O error, dev vda, sector 0 [ 4962.276360] end_request: I/O error, dev vda, sector 0
Drbd doesn't make use of such device.

I found a bug regarding kernel and barriers here:
https://bugzilla.redhat.com/show_bug.cgi?id=514901

which seems to state that virtio_blk (the one i'm using) had some problems like this and it seems related to barriers that are not supported by virtio_blk.

The machine was working well, so i exclude errors on the disk (it would probably be another device and another sector, not vda which is not used and not sector zeo which contains the MBR which just works).

Playing with DRBD, i tried both with and without "no-disk-barrier" in the disk section but this has no effect.
Assuming that
- the patches fixing that error should have gone into 2.6.31-RC1 (i'm using 2.6.31.2)
- that with 2.6.30.5 + drbd 8.3.3 was working well

what i'm asking is: should i submit a bug report to LKML or is this something related to DRBD ?

Attachment shows an excerpt of the log (drbdadm up resource && drbdadm down all).

DRBD configuration is probably not relevant. I tried all sort of combinations of the disk options in order to try to go on.

Max

P.S.: i should mention that the same configurations on a NON LVM partition does work ok.



Oct 14 15:44:28 sheet3-2 kernel: [ 5763.046856] block drbd4: Starting worker 
thread (from cqueue [3570])
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.063299] block drbd4: disk( Diskless -> 
Attaching ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.079767] block drbd4: Found 4 
transactions (8 active extents) in activity log.
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.092220] block drbd4: Method to ensure 
write ordering: flush
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.114228] block drbd4: Backing device's 
merge_bvec_fn() = ffffffffa0104d48
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.127910] block drbd4: max_segment_size ( 
= BIO size ) = 4096
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.143246] block drbd4: drbd_bm_resize 
called with capacity == 10485368
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.169516] block drbd4: resync bitmap: 
bits=1310671 words=20480
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.185229] block drbd4: size = 5120 MB 
(5242684 KB)
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.200066] block drbd4: recounting of set 
bits took additional 0 jiffies
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.216092] block drbd4: 0 KB (0 bits) 
marked out-of-sync by on disk bit-map.
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.232319] block drbd4: disk( Attaching -> 
UpToDate ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.257360] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.272226] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.287225] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.302133] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.329368] block drbd4: conn( StandAlone 
-> Unconnected ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.344785] block drbd4: Starting receiver 
thread (from drbd4_worker [8117])
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.361265] block drbd4: receiver 
(re)started
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.376679] block drbd4: conn( Unconnected 
-> WFConnection ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.470330] block drbd4: Handshake 
successful: Agreed network protocol version 90
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.483049] block drbd4: conn( WFConnection 
-> WFReportParams ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.505229] block drbd4: Starting asender 
thread (from drbd4_receiver [8134])
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.517786] block drbd4: 
data-integrity-alg: <not-used>
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.529886] block drbd4: 
drbd_sync_handshake:
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.541555] block drbd4: self 
77CDC2649F2A3E10:0000000000000000:2530EDF22AB3C2C6:BED99F1ED652F29B bits:0 
flags:0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.564899] block drbd4: peer 
921B33BCD7F28939:77CDC2649F2A3E11:2530EDF22AB3C2C7:BED99F1ED652F29B bits:25 
flags:0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.588346] block drbd4: uuid_compare()=-1 
by rule 50
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.600191] block drbd4: peer( Unknown -> 
Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.623715] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.638991] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.670439] block drbd4: conn( WFBitMapT -> 
WFSyncUUID ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.684012] block drbd4: helper command: 
/sbin/drbdadm before-resync-target minor-4
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.714731] block drbd4: helper command: 
/sbin/drbdadm before-resync-target minor-4 exit code 0 (0x0)
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.738585] block drbd4: conn( WFSyncUUID 
-> SyncTarget ) disk( UpToDate -> Inconsistent ) 
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.757108] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.757135] block drbd4: Began resync as 
SyncTarget (will sync 100 KB [25 bits set]).
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.799570] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.818428] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.834335] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.849540] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.864574] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.879470] block drbd4: Resync done (total 
1 sec; paused 0 sec; 100 K/sec)
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.883415] block drbd4: conn( SyncTarget 
-> Connected ) disk( Inconsistent -> UpToDate ) 
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.923349] block drbd4: helper command: 
/sbin/drbdadm after-resync-target minor-4
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.942685] block drbd4: helper command: 
/sbin/drbdadm after-resync-target minor-4 exit code 0 (0x0)
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.970463] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.985524] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.000564] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.015447] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.030425] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.045331] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.060252] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.075116] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.089965] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.104896] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.561861] block drbd4: peer( Primary -> 
Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown ) 
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.592290] block drbd4: short read 
expecting header on sock: r=-512
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.614874] block drbd4: meta connection 
shut down by peer.
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.630305] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.645234] block drbd4: asender terminated
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.649329] end_request: I/O error, dev 
vda, sector 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.664166] block drbd4: Terminating 
asender thread
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.678800] block drbd4: Connection closed
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.690406] block drbd4: conn( 
Disconnecting -> StandAlone ) 
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.702764] block drbd4: receiver terminated
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.702806] block drbd4: disk( UpToDate -> 
Diskless ) 
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.726513] block drbd4: Terminating 
receiver thread
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.738077] block drbd4: drbd_bm_resize 
called with capacity == 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.750090] block drbd4: worker terminated
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.761640] block drbd4: Terminating worker 
thread
_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

Reply via email to