Re: [PATCH v3 1/2] btrfs: scrub: fix circular locking dependency warning
On 12/04/2018 07:16 PM, David Sterba wrote: On Fri, Nov 30, 2018 at 01:15:23PM +0800, Anand Jain wrote: @@ -3757,10 +3757,13 @@ static noinline_for_stack int scrub_workers_get(struct btrfs_fs_info *fs_info, static noinline_for_stack void scrub_workers_put(struct btrfs_fs_info *fs_info) { + lockdep_assert_held(&fs_info->scrub_lock); [1] if (--fs_info->scrub_workers_refcnt == 0) { + mutex_unlock(&fs_info->scrub_lock); btrfs_destroy_workqueue(fs_info->scrub_workers); btrfs_destroy_workqueue(fs_info->scrub_wr_completion_workers); btrfs_destroy_workqueue(fs_info->scrub_parity_workers); + mutex_lock(&fs_info->scrub_lock); } WARN_ON(fs_info->scrub_workers_refcnt < 0); } btrfs/011 lockdep warning is gone, but now there's a list corruption reported by btrfs/073. I'm testing the 2 patches on top of current master to avoid interference with misc-next patches. Sorry for the delay due to my vacation. Thanks for the report, btrfs/073 exposed this on your test setup, it didn't on mine. I think I know what is happening.. now as scrub_worker_refcnt is zero [1] (above) and scrub_lock is released, the scrub_workers_get() can create a new worker even before the btrfs_destroy_workqueue() is completed. I am trying to prove this theory with traps. scrub threads has week synchronization, and assumes btrfs-progs is the only process which can call scrub-ioctl, apparently its not true. To fix it needs a bit of complete revival which I was trying to avoid. Let me try if there is a better fix for this. Thanks, Anand btrfs/073 [11:07:19] [ 3580.466293] run fstests btrfs/073at 2018-12-04 11:07:19 [ 3580.610367] BTRFS info (device vda): disk space caching is enabled [ 3580.612809] BTRFS info (device vda): has skinny extents [ 3580.876639] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 1 transid 5 /dev/vdb [ 3580.880569] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 2 transid 5 /dev/vdc [ 3580.882947] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 3 transid 5 /dev/vdd [ 3580.885499] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 4 transid 5 /dev/vde [ 3580.887972] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 5 transid 5 /dev/vdf [ 3580.890394] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 6 transid 5 /dev/vdg [ 3580.893729] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 7 transid 5 /dev/vdh [ 3580.903180] BTRFS info (device vdb): disk space caching is enabled [ 3580.904538] BTRFS info (device vdb): has skinny extents [ 3580.905322] BTRFS info (device vdb): flagging fs with big metadatafeature [ 3580.908555] BTRFS info (device vdb): checking UUID tree [ 3580.951440] [ cut here ] [ 3580.954122] list_add corruption. prev->next should be next(a189faa9acc8), but was a189faa9adc0. (prev=a189faa9a480). [ 3580.960061] WARNING: CPU: 0 PID: 24578 at lib/list_debug.c:28__list_add_valid+0x4d/0x70 [ 3580.962346] BTRFS info (device vdb): use no compression, level 0 [ 3580.963694] Modules linked in: dm_flakey dm_mod btrfs libcrc32c xorzstd_decompress zstd_compress xxhash raid6_pq loop [ 3580.963702] CPU: 0 PID: 24578 Comm: btrfs Not tainted4.20.0-rc5-default+ #366 [ 3580.963703] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),BIOS rel-1.11.2-0-gf9626cc-prebuilt.qemu-project.org 04/01/2014 [ 3580.963706] RIP: 0010:__list_add_valid+0x4d/0x70 [ 3580.963708] RSP: 0018:b72d88817b90 EFLAGS: 00010286 [ 3580.963709] RAX: RBX: b72d88817c10 RCX: [ 3580.963710] RDX: 0002 RSI: 0001 RDI:860c4c1d [ 3580.963710] RBP: a189faa9acc8 R08: 0001 R09: [ 3580.963714] R10: R11: 882a0a2d R12:a189faa9ac70 [ 3580.966020] BTRFS info (device vdb): disk space caching is enabled [ 3580.967721] R13: a189faa9a480 R14: a189faa9ac70 R15:a189faa9ac78 [ 3580.967724] FS: 7f04289d1700() GS:a189fd40()knlGS: [ 3580.967725] CS: 0010 DS: ES: CR0: 80050033 [ 3580.967725] CR2: 7f7b58b034ac CR3: 699d9000 CR4:06f0 [ 3580.967729] Call Trace: [ 3580.967736] __mutex_add_waiter+0x34/0x70 [ 3580.967743] ? drain_workqueue+0x1e/0x180 [ 3580.994465] __mutex_lock+0x134/0x9d0 [ 3580.995526] ? __schedule+0x2eb/0xb20 [ 3580.996584] ? drain_workqueue+0x1e/0x180 [ 3580.997727] drain_workqueue+0x1e/0x180 [ 3580.998793] destroy_workqueue+0x17/0x240 [ 3580.999879] btrfs_destroy_workqueue+0x57/0x200 [btrfs] [ 3581.001148] scrub_workers_put+0x6c/0x90 [btrfs] [ 3581.002257] btrfs_scrub_dev+0x2f6/0x590 [btrfs] [ 3581.003370] ? __sb_start_write+0x12c/0x1d0 [ 3581.004450] ? mnt_want_write_file+0x24/0x60 [ 3581.005613] btrfs_ioctl+0xfc7/0x2f00 [btrfs] [ 3581.006361] ? get_task_
Re: [PATCH v3 1/2] btrfs: scrub: fix circular locking dependency warning
On Fri, Nov 30, 2018 at 01:15:23PM +0800, Anand Jain wrote: > @@ -3757,10 +3757,13 @@ static noinline_for_stack int > scrub_workers_get(struct btrfs_fs_info *fs_info, > > static noinline_for_stack void scrub_workers_put(struct btrfs_fs_info > *fs_info) > { > + lockdep_assert_held(&fs_info->scrub_lock); > if (--fs_info->scrub_workers_refcnt == 0) { > + mutex_unlock(&fs_info->scrub_lock); > btrfs_destroy_workqueue(fs_info->scrub_workers); > btrfs_destroy_workqueue(fs_info->scrub_wr_completion_workers); > btrfs_destroy_workqueue(fs_info->scrub_parity_workers); > + mutex_lock(&fs_info->scrub_lock); > } > WARN_ON(fs_info->scrub_workers_refcnt < 0); > } btrfs/011 lockdep warning is gone, but now there's a list corruption reported by btrfs/073. I'm testing the 2 patches on top of current master to avoid interference with misc-next patches. btrfs/073 [11:07:19] [ 3580.466293] run fstests btrfs/073at 2018-12-04 11:07:19 [ 3580.610367] BTRFS info (device vda): disk space caching is enabled [ 3580.612809] BTRFS info (device vda): has skinny extents [ 3580.876639] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 1 transid 5 /dev/vdb [ 3580.880569] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 2 transid 5 /dev/vdc [ 3580.882947] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 3 transid 5 /dev/vdd [ 3580.885499] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 4 transid 5 /dev/vde [ 3580.887972] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 5 transid 5 /dev/vdf [ 3580.890394] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 6 transid 5 /dev/vdg [ 3580.893729] BTRFS: device fsid d452261d-c956-4b54-aab9-8318c3c211fcdevid 7 transid 5 /dev/vdh [ 3580.903180] BTRFS info (device vdb): disk space caching is enabled [ 3580.904538] BTRFS info (device vdb): has skinny extents [ 3580.905322] BTRFS info (device vdb): flagging fs with big metadatafeature [ 3580.908555] BTRFS info (device vdb): checking UUID tree [ 3580.951440] [ cut here ] [ 3580.954122] list_add corruption. prev->next should be next(a189faa9acc8), but was a189faa9adc0. (prev=a189faa9a480). [ 3580.960061] WARNING: CPU: 0 PID: 24578 at lib/list_debug.c:28__list_add_valid+0x4d/0x70 [ 3580.962346] BTRFS info (device vdb): use no compression, level 0 [ 3580.963694] Modules linked in: dm_flakey dm_mod btrfs libcrc32c xorzstd_decompress zstd_compress xxhash raid6_pq loop [ 3580.963702] CPU: 0 PID: 24578 Comm: btrfs Not tainted4.20.0-rc5-default+ #366 [ 3580.963703] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996),BIOS rel-1.11.2-0-gf9626cc-prebuilt.qemu-project.org 04/01/2014 [ 3580.963706] RIP: 0010:__list_add_valid+0x4d/0x70 [ 3580.963708] RSP: 0018:b72d88817b90 EFLAGS: 00010286 [ 3580.963709] RAX: RBX: b72d88817c10 RCX: [ 3580.963710] RDX: 0002 RSI: 0001 RDI:860c4c1d [ 3580.963710] RBP: a189faa9acc8 R08: 0001 R09: [ 3580.963714] R10: R11: 882a0a2d R12:a189faa9ac70 [ 3580.966020] BTRFS info (device vdb): disk space caching is enabled [ 3580.967721] R13: a189faa9a480 R14: a189faa9ac70 R15:a189faa9ac78 [ 3580.967724] FS: 7f04289d1700() GS:a189fd40()knlGS: [ 3580.967725] CS: 0010 DS: ES: CR0: 80050033 [ 3580.967725] CR2: 7f7b58b034ac CR3: 699d9000 CR4:06f0 [ 3580.967729] Call Trace: [ 3580.967736] __mutex_add_waiter+0x34/0x70 [ 3580.967743] ? drain_workqueue+0x1e/0x180 [ 3580.994465] __mutex_lock+0x134/0x9d0 [ 3580.995526] ? __schedule+0x2eb/0xb20 [ 3580.996584] ? drain_workqueue+0x1e/0x180 [ 3580.997727] drain_workqueue+0x1e/0x180 [ 3580.998793] destroy_workqueue+0x17/0x240 [ 3580.999879] btrfs_destroy_workqueue+0x57/0x200 [btrfs] [ 3581.001148] scrub_workers_put+0x6c/0x90 [btrfs] [ 3581.002257] btrfs_scrub_dev+0x2f6/0x590 [btrfs] [ 3581.003370] ? __sb_start_write+0x12c/0x1d0 [ 3581.004450] ? mnt_want_write_file+0x24/0x60 [ 3581.005613] btrfs_ioctl+0xfc7/0x2f00 [btrfs] [ 3581.006361] ? get_task_io_context+0x2d/0x90 [ 3581.007025] ? do_vfs_ioctl+0xa2/0x6d0 [ 3581.007699] do_vfs_ioctl+0xa2/0x6d0 [ 3581.008603] ? __fget+0x109/0x1e0 [ 3581.009413] ksys_ioctl+0x3a/0x70 [ 3581.010326] __x64_sys_ioctl+0x16/0x20 [ 3581.011234] do_syscall_64+0x54/0x180 [ 3581.012143] entry_SYSCALL_64_after_hwframe+0x49/0xbe [ 3581.013213] RIP: 0033:0x7f042bac9aa7 [ 3581.014133] Code: 00 00 90 48 8b 05 f1 83 2c 00 64 c7 00 26 00 00 0048 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c1 83 2c 00 f7 d8 64 89 01 48 [ 3581.018486] RSP: 002b:7f04289d0d38 EFLAGS: 0246 ORIG_RAX:0010 [ 3581.019703] RAX: ffda RBX: 55c05475de10