Hello,
i'm using ocfs2 on top of lvm on top of drbd in a KVM virtual server and
fighing with kernel bugs (ocfs, virtio) for weeks.
Now that seems start to work, i have found something more that is weird.
Kernel: 2.6.31.2
Drbd: 8.3.4
When starting drbd, i see thousands of those errors:
[ 4962.247701] end_request: I/O error, dev vda, sector
0
[ 4962.263811] end_request: I/O error, dev vda, sector
0
[ 4962.276360] end_request: I/O error, dev vda, sector
0
Drbd doesn't make use of such device.
I found a bug regarding kernel and barriers here:
https://bugzilla.redhat.com/show_bug.cgi?id=514901
which seems to state that virtio_blk (the one i'm using) had some
problems like this and it seems related to barriers that are not
supported by virtio_blk.
The machine was working well, so i exclude errors on the disk (it would
probably be another device and another sector, not vda which is not used
and not sector zeo which contains the MBR which just works).
Playing with DRBD, i tried both with and without "no-disk-barrier" in
the disk section but this has no effect.
Assuming that
- the patches fixing that error should have gone into 2.6.31-RC1 (i'm
using 2.6.31.2)
- that with 2.6.30.5 + drbd 8.3.3 was working well
what i'm asking is: should i submit a bug report to LKML or is this
something related to DRBD ?
Attachment shows an excerpt of the log (drbdadm up resource && drbdadm
down all).
DRBD configuration is probably not relevant. I tried all sort of
combinations of the disk options in order to try to go on.
Max
P.S.: i should mention that the same configurations on a NON LVM
partition does work ok.
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.046856] block drbd4: Starting worker
thread (from cqueue [3570])
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.063299] block drbd4: disk( Diskless ->
Attaching )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.079767] block drbd4: Found 4
transactions (8 active extents) in activity log.
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.092220] block drbd4: Method to ensure
write ordering: flush
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.114228] block drbd4: Backing device's
merge_bvec_fn() = ffffffffa0104d48
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.127910] block drbd4: max_segment_size (
= BIO size ) = 4096
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.143246] block drbd4: drbd_bm_resize
called with capacity == 10485368
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.169516] block drbd4: resync bitmap:
bits=1310671 words=20480
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.185229] block drbd4: size = 5120 MB
(5242684 KB)
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.200066] block drbd4: recounting of set
bits took additional 0 jiffies
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.216092] block drbd4: 0 KB (0 bits)
marked out-of-sync by on disk bit-map.
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.232319] block drbd4: disk( Attaching ->
UpToDate )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.257360] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.272226] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.287225] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.302133] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.329368] block drbd4: conn( StandAlone
-> Unconnected )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.344785] block drbd4: Starting receiver
thread (from drbd4_worker [8117])
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.361265] block drbd4: receiver
(re)started
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.376679] block drbd4: conn( Unconnected
-> WFConnection )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.470330] block drbd4: Handshake
successful: Agreed network protocol version 90
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.483049] block drbd4: conn( WFConnection
-> WFReportParams )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.505229] block drbd4: Starting asender
thread (from drbd4_receiver [8134])
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.517786] block drbd4:
data-integrity-alg: <not-used>
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.529886] block drbd4:
drbd_sync_handshake:
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.541555] block drbd4: self
77CDC2649F2A3E10:0000000000000000:2530EDF22AB3C2C6:BED99F1ED652F29B bits:0
flags:0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.564899] block drbd4: peer
921B33BCD7F28939:77CDC2649F2A3E11:2530EDF22AB3C2C7:BED99F1ED652F29B bits:25
flags:0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.588346] block drbd4: uuid_compare()=-1
by rule 50
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.600191] block drbd4: peer( Unknown ->
Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.623715] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.638991] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.670439] block drbd4: conn( WFBitMapT ->
WFSyncUUID )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.684012] block drbd4: helper command:
/sbin/drbdadm before-resync-target minor-4
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.714731] block drbd4: helper command:
/sbin/drbdadm before-resync-target minor-4 exit code 0 (0x0)
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.738585] block drbd4: conn( WFSyncUUID
-> SyncTarget ) disk( UpToDate -> Inconsistent )
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.757108] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.757135] block drbd4: Began resync as
SyncTarget (will sync 100 KB [25 bits set]).
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.799570] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:28 sheet3-2 kernel: [ 5763.818428] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.834335] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.849540] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.864574] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.879470] block drbd4: Resync done (total
1 sec; paused 0 sec; 100 K/sec)
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.883415] block drbd4: conn( SyncTarget
-> Connected ) disk( Inconsistent -> UpToDate )
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.923349] block drbd4: helper command:
/sbin/drbdadm after-resync-target minor-4
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.942685] block drbd4: helper command:
/sbin/drbdadm after-resync-target minor-4 exit code 0 (0x0)
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.970463] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5763.985524] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.000564] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.015447] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.030425] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.045331] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.060252] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.075116] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.089965] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:29 sheet3-2 kernel: [ 5764.104896] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.561861] block drbd4: peer( Primary ->
Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.592290] block drbd4: short read
expecting header on sock: r=-512
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.614874] block drbd4: meta connection
shut down by peer.
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.630305] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.645234] block drbd4: asender terminated
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.649329] end_request: I/O error, dev
vda, sector 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.664166] block drbd4: Terminating
asender thread
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.678800] block drbd4: Connection closed
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.690406] block drbd4: conn(
Disconnecting -> StandAlone )
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.702764] block drbd4: receiver terminated
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.702806] block drbd4: disk( UpToDate ->
Diskless )
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.726513] block drbd4: Terminating
receiver thread
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.738077] block drbd4: drbd_bm_resize
called with capacity == 0
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.750090] block drbd4: worker terminated
Oct 14 15:44:34 sheet3-2 kernel: [ 5769.761640] block drbd4: Terminating worker
thread
_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user