subject:"\[PATCH\] ext4\: move buffer_mapped\(\) to proper position"

Re: [PATCH] ext4: move buffer_mapped() to proper position

2020-08-07 Thread Andreas Dilger


> On Aug 7, 2020, at 2:02 PM, ty...@mit.edu wrote:
> 
> Thanks, applied, although I rewrote the commit description to make it
> be a bit more clearer:
> 
>fs: prevent BUG_ON in submit_bh_wbc()
> 
>If a device is hot-removed --- for example, when a physical device is
>unplugged from pcie slot or a nbd device's network is shutdown ---
>this can result in a BUG_ON() crash in submit_bh_wbc().  This is
>because the when the block device dies, the buffer heads will have
>their Buffer_Mapped flag get cleared, leading to the crash in
>submit_bh_wbc.
> 
>We had attempted to work around this problem in commit a17712c8

Should this get a "Fixes:" label with this info, rather than embedding
it in the commit message, so that it could be picked up by stable?

Cheers, Andreas

>("ext4: check superblock mapped prior to committing").  Unfortunately,
>it's still possible to hit the BUG_ON(!buffer_mapped(bh)) if the
>device dies between when the work-around check in ext4_commit_super()
>and when submit_bh_wbh() is finally called:
> 
>Code path:
>ext4_commit_super
>judge if 'buffer_mapped(sbh)' is false, return <== commit a17712c8
>  lock_buffer(sbh)
>  ...
>  unlock_buffer(sbh)
>   __sync_dirty_buffer(sbh,...
>lock_buffer(sbh)
>judge if 'buffer_mapped(sbh))' is false, return 
> <== added by this patch
>submit_bh(...,sbh)
>submit_bh_wbc(...,sbh,...)
> 
>[100722.966497] kernel BUG at fs/buffer.c:3095! <== 
> BUG_ON(!buffer_mapped(bh))' in submit_bh_wbc()
>[100722.966503] invalid opcode:  [#1] SMP
>[100722.966566] task: 8817e15a9e40 task.stack: c90024744000
>[100722.966574] RIP: 0010:submit_bh_wbc+0x180/0x190
>[100722.966575] RSP: 0018:c90024747a90 EFLAGS: 00010246
>[100722.966576] RAX: 00620005 RBX: 8818a80603a8 RCX: 
> 
>[100722.966576] RDX: 8818a80603a8 RSI: 00020800 RDI: 
> 0001
>[100722.966577] RBP: c90024747ac0 R08:  R09: 
> 88207f94170d
>[100722.966578] R10: 000437c8 R11: 0001 R12: 
> 00020800
>[100722.966578] R13: 0001 R14: 0bf9a438 R15: 
> 88195f333000
>[100722.966580] FS:  7fa2eee27700() GS:88203d84() 
> knlGS:
>[100722.966580] CS:  0010 DS:  ES:  CR0: 80050033
>[100722.966581] CR2: 00f0b008 CR3: 00201a622003 CR4: 
> 007606e0
>[100722.966582] DR0:  DR1:  DR2: 
> 
>[100722.966583] DR3:  DR6: fffe0ff0 DR7: 
> 0400
>[100722.966583] PKRU: 5554
>[100722.966583] Call Trace:
>[100722.966588]  __sync_dirty_buffer+0x6e/0xd0
>[100722.966614]  ext4_commit_super+0x1d8/0x290 [ext4]
>[100722.966626]  __ext4_std_error+0x78/0x100 [ext4]
>[100722.966635]  ? __ext4_journal_get_write_access+0xca/0x120 [ext4]
>[100722.966646]  ext4_reserve_inode_write+0x58/0xb0 [ext4]
>[100722.966655]  ? ext4_dirty_inode+0x48/0x70 [ext4]
>[100722.93]  ext4_mark_inode_dirty+0x53/0x1e0 [ext4]
>[100722.966671]  ? __ext4_journal_start_sb+0x6d/0xf0 [ext4]
>[100722.966679]  ext4_dirty_inode+0x48/0x70 [ext4]
>[100722.966682]  __mark_inode_dirty+0x17f/0x350
>[100722.966686]  generic_update_time+0x87/0xd0
>[100722.966687]  touch_atime+0xa9/0xd0
>[100722.966690]  generic_file_read_iter+0xa09/0xcd0
>[100722.966694]  ? page_cache_tree_insert+0xb0/0xb0
>[100722.966704]  ext4_file_read_iter+0x4a/0x100 [ext4]
>[100722.966707]  ? __inode_security_revalidate+0x4f/0x60
>[100722.966709]  __vfs_read+0xec/0x160
>[100722.966711]  vfs_read+0x8c/0x130
>[100722.966712]  SyS_pread64+0x87/0xb0
>[100722.966716]  do_syscall_64+0x67/0x1b0
>[100722.966719]  entry_SYSCALL64_slow_path+0x25/0x25
> 
>To address this, add the check of 'buffer_mapped(bh)' to
>__sync_dirty_buffer().  This also has the benefit of fixing this for
>other file systems.
> 
>With this addition, we can drop the workaround in ext4_commit_supper().
> 
>[ Commit description rewritten by tytso. ]
> 
>Signed-off-by: Xianting Tian 
>Link: 
> https://lore.kernel.org/r/1596211825-8750-1-git-send-email-xianting_t...@126.com
>Signed-off-by: Theodore Ts'o 
> 
>   - Ted


Cheers, Andreas







signature.asc
Description: Message signed with OpenPGP

Re: [PATCH] ext4: move buffer_mapped() to proper position

2020-08-07 Thread tytso

Thanks, applied, although I rewrote the commit description to make it
be a bit more clearer:

fs: prevent BUG_ON in submit_bh_wbc()

If a device is hot-removed --- for example, when a physical device is
unplugged from pcie slot or a nbd device's network is shutdown ---
this can result in a BUG_ON() crash in submit_bh_wbc().  This is
because the when the block device dies, the buffer heads will have
their Buffer_Mapped flag get cleared, leading to the crash in
submit_bh_wbc.

We had attempted to work around this problem in commit a17712c8
("ext4: check superblock mapped prior to committing").  Unfortunately,
it's still possible to hit the BUG_ON(!buffer_mapped(bh)) if the
device dies between when the work-around check in ext4_commit_super()
and when submit_bh_wbh() is finally called:

Code path:
ext4_commit_super
judge if 'buffer_mapped(sbh)' is false, return <== commit a17712c8
  lock_buffer(sbh)
  ...
  unlock_buffer(sbh)
   __sync_dirty_buffer(sbh,...
lock_buffer(sbh)
judge if 'buffer_mapped(sbh))' is false, return <== 
added by this patch
submit_bh(...,sbh)
submit_bh_wbc(...,sbh,...)

[100722.966497] kernel BUG at fs/buffer.c:3095! <== 
BUG_ON(!buffer_mapped(bh))' in submit_bh_wbc()
[100722.966503] invalid opcode:  [#1] SMP
[100722.966566] task: 8817e15a9e40 task.stack: c90024744000
[100722.966574] RIP: 0010:submit_bh_wbc+0x180/0x190
[100722.966575] RSP: 0018:c90024747a90 EFLAGS: 00010246
[100722.966576] RAX: 00620005 RBX: 8818a80603a8 RCX: 

[100722.966576] RDX: 8818a80603a8 RSI: 00020800 RDI: 
0001
[100722.966577] RBP: c90024747ac0 R08:  R09: 
88207f94170d
[100722.966578] R10: 000437c8 R11: 0001 R12: 
00020800
[100722.966578] R13: 0001 R14: 0bf9a438 R15: 
88195f333000
[100722.966580] FS:  7fa2eee27700() GS:88203d84() 
knlGS:
[100722.966580] CS:  0010 DS:  ES:  CR0: 80050033
[100722.966581] CR2: 00f0b008 CR3: 00201a622003 CR4: 
007606e0
[100722.966582] DR0:  DR1:  DR2: 

[100722.966583] DR3:  DR6: fffe0ff0 DR7: 
0400
[100722.966583] PKRU: 5554
[100722.966583] Call Trace:
[100722.966588]  __sync_dirty_buffer+0x6e/0xd0
[100722.966614]  ext4_commit_super+0x1d8/0x290 [ext4]
[100722.966626]  __ext4_std_error+0x78/0x100 [ext4]
[100722.966635]  ? __ext4_journal_get_write_access+0xca/0x120 [ext4]
[100722.966646]  ext4_reserve_inode_write+0x58/0xb0 [ext4]
[100722.966655]  ? ext4_dirty_inode+0x48/0x70 [ext4]
[100722.93]  ext4_mark_inode_dirty+0x53/0x1e0 [ext4]
[100722.966671]  ? __ext4_journal_start_sb+0x6d/0xf0 [ext4]
[100722.966679]  ext4_dirty_inode+0x48/0x70 [ext4]
[100722.966682]  __mark_inode_dirty+0x17f/0x350
[100722.966686]  generic_update_time+0x87/0xd0
[100722.966687]  touch_atime+0xa9/0xd0
[100722.966690]  generic_file_read_iter+0xa09/0xcd0
[100722.966694]  ? page_cache_tree_insert+0xb0/0xb0
[100722.966704]  ext4_file_read_iter+0x4a/0x100 [ext4]
[100722.966707]  ? __inode_security_revalidate+0x4f/0x60
[100722.966709]  __vfs_read+0xec/0x160
[100722.966711]  vfs_read+0x8c/0x130
[100722.966712]  SyS_pread64+0x87/0xb0
[100722.966716]  do_syscall_64+0x67/0x1b0
[100722.966719]  entry_SYSCALL64_slow_path+0x25/0x25

To address this, add the check of 'buffer_mapped(bh)' to
__sync_dirty_buffer().  This also has the benefit of fixing this for
other file systems.

With this addition, we can drop the workaround in ext4_commit_supper().

[ Commit description rewritten by tytso. ]

Signed-off-by: Xianting Tian 
Link: 
https://lore.kernel.org/r/1596211825-8750-1-git-send-email-xianting_t...@126.com
Signed-off-by: Theodore Ts'o 

- Ted

[PATCH] ext4: move buffer_mapped() to proper position

2020-07-31 Thread Xianting Tian

As you know, commit a17712c8 has added below code to aviod a
crash( 'BUG_ON(!buffer_mapped(bh))' in submit_bh_wbc) when
device hot-removed(a physical device is unpluged from pcie slot
or a nbd device's network is shutdown).
static int ext4_commit_super():
if (!sbh || block_device_ejected(sb))
return error;
+
+   /*
+* The superblock bh should be mapped, but it might not be if the
+* device was hot-removed. Not much we can do but fail the I/O.
+*/
+   if (!buffer_mapped(sbh))
+   return error;

And the call trace, which leads to the crash, as below:
ext4_commit_super()
  __sync_dirty_buffer()
submit_bh()
  submit_bh_wbc()
BUG_ON(!buffer_mapped(bh));

But recently we met the same crash(with very low probability) when
device hot-removed even though the kernel already contained
above exception protection code. Still, the crash is caused by
'BUG_ON(!buffer_mapped(bh))' in submit_bh_wbc(), and the same
call trace as below.

As my understanding and below code，there are still some more
codes needs to run between 'buffer_mapped(sbh)'(which is added
by commit a17712c8) and 'BUG_ON(!buffer_mapped(bh))' in
submit_bh_wbc(), especially lock_buffer is called two times(sometimes,
it may take more times to get the lock). So when do the test of
device hot-remove, there is low probability that the sbh is mapped
when executing 'buffer_mapped(sbh)'(which is added by commit a17712c8)
but sbh is not mapped when executing 'BUG_ON(!buffer_mapped(bh))'
in submit_bh_wbc().
Code path:
ext4_commit_super
judge if 'buffer_mapped(sbh)' is false, return <== commit a17712c8
  lock_buffer(sbh)
  ...
  unlock_buffer(sbh)
   __sync_dirty_buffer(sbh,...
lock_buffer(sbh)
judge if 'buffer_mapped(sbh))' is false, return <== 
added by this patch
submit_bh(...,sbh)
submit_bh_wbc(...,sbh,...)

This patch is to move the check of 'buffer_mapped(sbh)' to the place just
before calling 'BUG_ON(!buffer_mapped(bh))' in submit_bh_wbc().

[100722.966497] kernel BUG at fs/buffer.c:3095! <== BUG_ON(!buffer_mapped(bh))' 
in submit_bh_wbc()
[100722.966503] invalid opcode:  [#1] SMP
[100722.966566] task: 8817e15a9e40 task.stack: c90024744000
[100722.966574] RIP: 0010:submit_bh_wbc+0x180/0x190
[100722.966575] RSP: 0018:c90024747a90 EFLAGS: 00010246
[100722.966576] RAX: 00620005 RBX: 8818a80603a8 RCX: 

[100722.966576] RDX: 8818a80603a8 RSI: 00020800 RDI: 
0001
[100722.966577] RBP: c90024747ac0 R08:  R09: 
88207f94170d
[100722.966578] R10: 000437c8 R11: 0001 R12: 
00020800
[100722.966578] R13: 0001 R14: 0bf9a438 R15: 
88195f333000
[100722.966580] FS:  7fa2eee27700() GS:88203d84() 
knlGS:
[100722.966580] CS:  0010 DS:  ES:  CR0: 80050033
[100722.966581] CR2: 00f0b008 CR3: 00201a622003 CR4: 
007606e0
[100722.966582] DR0:  DR1:  DR2: 

[100722.966583] DR3:  DR6: fffe0ff0 DR7: 
0400
[100722.966583] PKRU: 5554
[100722.966583] Call Trace:
[100722.966588]  __sync_dirty_buffer+0x6e/0xd0
[100722.966614]  ext4_commit_super+0x1d8/0x290 [ext4]
[100722.966626]  __ext4_std_error+0x78/0x100 [ext4]
[100722.966635]  ? __ext4_journal_get_write_access+0xca/0x120 [ext4]
[100722.966646]  ext4_reserve_inode_write+0x58/0xb0 [ext4]
[100722.966655]  ? ext4_dirty_inode+0x48/0x70 [ext4]
[100722.93]  ext4_mark_inode_dirty+0x53/0x1e0 [ext4]
[100722.966671]  ? __ext4_journal_start_sb+0x6d/0xf0 [ext4]
[100722.966679]  ext4_dirty_inode+0x48/0x70 [ext4]
[100722.966682]  __mark_inode_dirty+0x17f/0x350
[100722.966686]  generic_update_time+0x87/0xd0
[100722.966687]  touch_atime+0xa9/0xd0
[100722.966690]  generic_file_read_iter+0xa09/0xcd0
[100722.966694]  ? page_cache_tree_insert+0xb0/0xb0
[100722.966704]  ext4_file_read_iter+0x4a/0x100 [ext4]
[100722.966707]  ? __inode_security_revalidate+0x4f/0x60
[100722.966709]  __vfs_read+0xec/0x160
[100722.966711]  vfs_read+0x8c/0x130
[100722.966712]  SyS_pread64+0x87/0xb0
[100722.966716]  do_syscall_64+0x67/0x1b0
[100722.966719]  entry_SYSCALL64_slow_path+0x25/0x25

Signed-off-by: Xianting Tian 
---
 fs/buffer.c | 9 +
 fs/ext4/super.c | 7 ---
 2 files changed, 9 insertions(+), 7 deletions(-)

diff --git a/fs/buffer.c b/fs/buffer.c
index 64fe82e..75a8849 100644
--- a/fs/buffer.c
+++ b/fs/buffer.c
@@ -3160,6 +3160,15 @@ int __sync_dirty_buffer(struct buffer_head *bh, int 
op_flags)
WARN_ON(atomic_read(&bh->b_count) < 1);
lock_buffer(bh);
if (test_clear_buffer_dirty(bh)) {
+   /*
+* The bh should be mapped, but it might not be if the
+* device was hot-rem

Re: [PATCH] ext4: move buffer_mapped() to proper position

Re: [PATCH] ext4: move buffer_mapped() to proper position

[PATCH] ext4: move buffer_mapped() to proper position

3 matches

Site Navigation

Mail list logo

Footer information