Hi again,
So I intentionally broke this small raid6 fs on a VM to learn recovery
strategies for another much bigger raid6 I have running (which also
suffered a drive failure).
Basically I zeroed out one of the drives (vdd) from under the running
vm. Then ran an md5sum on a file on the fs to trigger some detection of
data inconsistency. I ran a scrub, which completed "ok". Then rebooted.
Now trying to mount the filesystem in degraded mode leads to a kernel crash.
I'm using kernel 4.3-rc6 and btrfs-progs 4.2.3
Linux ubuntu 4.3.0-040300rc6-generic #201510182030 SMP Mon Oct 19
00:31:41 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
Label: none uuid: aee28657-3ce0-4efc-9cd3-cc7c58782af3
Total devices 1 FS bytes used 1.87GiB
devid 1 size 9.52GiB used 2.89GiB path /dev/vda2
warning devid 3 not found already
Label: 'boxofkittens' uuid: 4957afbe-e2cb-410c-8d45-3850840898f2
Total devices 9 FS bytes used 3.56GiB
devid 1 size 1022.00MiB used 716.19MiB path /dev/vdb1
devid 2 size 1022.00MiB used 716.19MiB path /dev/vdc1
devid 4 size 1022.00MiB used 716.19MiB path /dev/vde1
devid 5 size 1022.00MiB used 716.19MiB path /dev/vdf1
devid 6 size 1022.00MiB used 716.19MiB path /dev/vdg1
devid 7 size 2.00GiB used 1.70GiB path /dev/vdh1
devid 8 size 3.00GiB used 1.70GiB path /dev/vdi1
devid 9 size 3.00GiB used 1.70GiB path /dev/vdj1
*** Some devices missing
btrfs-progs v4.2.3
mount -o degraded /dev/vdb1 /mnt/boxofkittens
[ 36.426731] ------------[ cut here ]------------
[ 36.427547] kernel BUG at
/home/kernel/COD/linux/fs/btrfs/extent_io.c:2044!
[ 36.428686] invalid opcode: 0000 [#1] SMP
[ 36.429438] Modules linked in: snd_hda_codec_generic iosf_mbi
crct10dif_pclmul crc32_pclmul ppdev aesni_intel aes_x86_64 lrw gf128mul
glue_helper ablk_helper cryptd input_leds joydev snd_hda_intel serio_raw
snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore
i2c_piix4 parport_pc parport 8250_fintek mac_hid autofs4 btrfs xor
raid6_pq cirrus ttm psmouse drm_kms_helper syscopyarea sysfillrect
sysimgblt fb_sys_fops drm floppy pata_acpi
[ 36.436782] CPU: 0 PID: 86 Comm: kworker/u2:2 Not tainted
4.3.0-040300rc6-generic #201510182030
[ 36.438138] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),
BIOS 1.8.1-20150318_183358- 04/01/2014
[ 36.439648] Workqueue: btrfs-endio btrfs_endio_helper [btrfs]
[ 36.440617] task: ffff880035b4e200 ti: ffff880035564000 task.ti:
ffff880035564000
[ 36.441778] RIP: 0010:[<ffffffffc0169d39>] [<ffffffffc0169d39>]
repair_io_failure+0x1a9/0x1f0 [btrfs]
[ 36.443287] RSP: 0018:ffff880035567c20 EFLAGS: 00010246
[ 36.444128] RAX: ffff88003c7ad000 RBX: ffff8800363dc7d0 RCX:
0000000000000000
[ 36.445227] RDX: 0000000000001000 RSI: 0000000000027000 RDI:
ffff8800388ce100
[ 36.446315] RBP: ffff880035567c78 R08: ffffea0000ddb640 R09:
0000000000000000
[ 36.447397] R10: ffff8800363dc980 R11: ffff88003bd49b00 R12:
0000000000027000
[ 36.448479] R13: ffff8800388ce000 R14: ffff8800363dc980 R15:
ffff8800363dc838
[ 36.449553] FS: 0000000000000000(0000) GS:ffff88003fc00000(0000)
knlGS:0000000000000000
[ 36.450766] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 36.451641] CR2: 0000000002015008 CR3: 000000003c1be000 CR4:
00000000000406f0
[ 36.452709] Stack:
[ 36.453026] 0000000000027000 0000000035567c48 ffffea0000ddb640
00000002b1047000
[ 36.454211] 0000000000000000 0000000000000000 ffff8800363dc7d0
0000000000027000
[ 36.455513] ffff8800388ce000 ffff8800363dc980 ffff8800363dc838
ffff880035567ce8
[ 36.456663] Call Trace:
[ 36.457043] [<ffffffffc0169fed>] clean_io_failure+0x18d/0x1a0 [btrfs]
[ 36.458002] [<ffffffffc016a3ea>] end_bio_extent_readpage+0x30a/0x560
[btrfs]
[ 36.459662] [<ffffffffc016a0e0>] ? btrfs_create_repair_bio+0xe0/0xe0
[btrfs]
[ 36.460715] [<ffffffff81371f40>] bio_endio+0x40/0x60
[ 36.461459] [<ffffffffc013fbbc>] end_workqueue_fn+0x3c/0x40 [btrfs]
[ 36.462387] [<ffffffffc0178ff0>] normal_work_helper+0xc0/0x270 [btrfs]
[ 36.463360] [<ffffffffc0179272>] btrfs_endio_helper+0x12/0x20 [btrfs]
[ 36.464314] [<ffffffff8109251e>] process_one_work+0x14e/0x3d0
[ 36.465158] [<ffffffff81092bca>] worker_thread+0x11a/0x470
[ 36.466264] [<ffffffff81092ab0>] ? rescuer_thread+0x310/0x310
[ 36.467154] [<ffffffff81098319>] kthread+0xc9/0xe0
[ 36.467863] [<ffffffff81098250>] ? kthread_park+0x60/0x60
[ 36.468791] [<ffffffff817b3fcf>] ret_from_fork+0x3f/0x70
[ 36.470022] [<ffffffff81098250>] ? kthread_park+0x60/0x60
[ 36.471334] Code: fe ff ff 48 89 df 41 bf fb ff ff ff e8 21 70 20 c1
31 f6 4c 89 ef e8 07 eb 00 00 e9 d1 fe ff ff 41 bf fb ff ff ff e9 c6 fe
ff ff <0f> 0b 0f 0b 49 8b 4d 30 49 8b b6 58 fe ff ff 48 83 c1 10 48 85
[ 36.475278] RIP [<ffffffffc0169d39>] repair_io_failure+0x1a9/0x1f0
[btrfs]
[ 36.476256] RSP <ffff880035567c20>
[ 36.476783] ---[ end trace a06ea60748bbedae ]---
[ 36.481369] BUG: unable to handle kernel paging request at
ffffffffffffffd8
[ 36.484441] IP: [<ffffffff81098a90>] kthread_data+0x10/0x20
[ 36.486710] PGD 1c13067 PUD 1c15067 PMD 0
[ 36.488690] Oops: 0000 [#2] SMP
[ 36.490516] Modules linked in: snd_hda_codec_generic iosf_mbi
crct10dif_pclmul crc32_pclmul ppdev aesni_intel aes_x86_64 lrw gf128mul
glue_helper ablk_helper cryptd input_leds joydev snd_hda_intel serio_raw
snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore
i2c_piix4 parport_pc parport 8250_fintek mac_hid autofs4 btrfs xor
raid6_pq cirrus ttm psmouse drm_kms_helper syscopyarea sysfillrect
sysimgblt fb_sys_fops drm floppy pata_acpi
[ 36.503559] CPU: 0 PID: 86 Comm: kworker/u2:2 Tainted: G D
4.3.0-040300rc6-generic #201510182030
[ 36.507074] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),
BIOS 1.8.1-20150318_183358- 04/01/2014
[ 36.510483] task: ffff880035b4e200 ti: ffff880035564000 task.ti:
ffff880035564000
[ 36.513383] RIP: 0010:[<ffffffff81098a90>] [<ffffffff81098a90>]
kthread_data+0x10/0x20
[ 36.516544] RSP: 0018:ffff880035567918 EFLAGS: 00010002
[ 36.519266] RAX: 0000000000000000 RBX: 0000000000000000 RCX:
0000000000000000
[ 36.522101] RDX: ffff88003e002090 RSI: 0000000000000000 RDI:
ffff880035b4e200
[ 36.524889] RBP: ffff880035567918 R08: 000000087e7534f3 R09:
ffff88003fc1b3b0
[ 36.527653] R10: ffffffff8137d322 R11: 0000000000000000 R12:
ffff880035b4e200
[ 36.531102] R13: 0000000000016980 R14: 0000000000000000 R15:
0000000000000000
[ 36.533401] FS: 0000000000000000(0000) GS:ffff88003fc00000(0000)
knlGS:0000000000000000
[ 36.535451] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 36.538332] CR2: 0000000000000028 CR3: 000000003dbd0000 CR4:
00000000000406f0
[ 36.540780] Stack:
[ 36.542302] ffff880035567930 ffffffff810933f1 ffff88003fc16980
ffff880035567978
[ 36.544435] ffffffff817b0154 ffff880035567970 ffff880035b4e200
ffff880035568000
[ 36.546827] ffff880035b4e648 0000000000000246 ffff880035b4e200
ffff88003e2e8000
[ 36.548918] Call Trace:
[ 36.550402] [<ffffffff810933f1>] wq_worker_sleeping+0x11/0x90
[ 36.552397] [<ffffffff817b0154>] __schedule+0x674/0x920
[ 36.554194] [<ffffffff817b0433>] schedule+0x33/0x80
[ 36.555970] [<ffffffff8107d35b>] do_exit+0x67b/0xaf0
[ 36.557724] [<ffffffff81019cf1>] oops_end+0xa1/0xd0
[ 36.559459] [<ffffffff8101a1fb>] die+0x4b/0x70
[ 36.561117] [<ffffffff810171cd>] do_trap+0x13d/0x150
[ 36.562875] [<ffffffff81017767>] do_error_trap+0x77/0xe0
[ 36.564992] [<ffffffffc0169d39>] ? repair_io_failure+0x1a9/0x1f0 [btrfs]
[ 36.567131] [<ffffffff811d02e9>] ? __slab_free+0x59/0x280
[ 36.568919] [<ffffffff81371f40>] ? bio_endio+0x40/0x60
[ 36.570720] [<ffffffff81017a40>] do_invalid_op+0x20/0x30
[ 36.573180] [<ffffffff817b574e>] invalid_op+0x1e/0x30
[ 36.575082] [<ffffffffc0169d39>] ? repair_io_failure+0x1a9/0x1f0 [btrfs]
[ 36.577043] [<ffffffffc0169fed>] clean_io_failure+0x18d/0x1a0 [btrfs]
[ 36.578948] [<ffffffffc016a3ea>] end_bio_extent_readpage+0x30a/0x560
[btrfs]
[ 36.580921] [<ffffffffc016a0e0>] ? btrfs_create_repair_bio+0xe0/0xe0
[btrfs]
[ 36.582870] [<ffffffff81371f40>] bio_endio+0x40/0x60
[ 36.584589] [<ffffffffc013fbbc>] end_workqueue_fn+0x3c/0x40 [btrfs]
[ 36.586553] [<ffffffffc0178ff0>] normal_work_helper+0xc0/0x270 [btrfs]
[ 36.588404] [<ffffffffc0179272>] btrfs_endio_helper+0x12/0x20 [btrfs]
[ 36.590237] [<ffffffff8109251e>] process_one_work+0x14e/0x3d0
[ 36.591996] [<ffffffff81092bca>] worker_thread+0x11a/0x470
[ 36.593727] [<ffffffff81092ab0>] ? rescuer_thread+0x310/0x310
[ 36.595663] [<ffffffff81098319>] kthread+0xc9/0xe0
[ 36.597613] [<ffffffff81098250>] ? kthread_park+0x60/0x60
[ 36.599431] [<ffffffff817b3fcf>] ret_from_fork+0x3f/0x70
[ 36.601067] [<ffffffff81098250>] ? kthread_park+0x60/0x60
[ 36.602667] Code: ba 01 00 00 00 75 c1 48 8b 43 08 eb 9b 48 89 df e8
b6 f8 ff ff eb af 0f 1f 40 00 66 66 66 66 90 55 48 8b 87 f8 04 00 00 48
89 e5 <48> 8b 40 d8 5d c3 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90
[ 36.607519] RIP [<ffffffff81098a90>] kthread_data+0x10/0x20
[ 36.609104] RSP <ffff880035567918>
[ 36.610810] CR2: ffffffffffffffd8
[ 36.612460] ---[ end trace a06ea60748bbedaf ]---
[ 36.614385] Fixing recursive fault but reboot is needed!
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html