On 2021/2/27 上午9:01, Qu Wenruo wrote:


On 2021/2/27 上午1:51, fdman...@kernel.org wrote:
From: Filipe Manana <fdman...@suse.com>

When we have smack enabled, during the creation of a directory smack may
attempt to add a "smack transmute" xattr on the inode, which results in
the following warning and trace:

[  220.732359] ------------[ cut here ]------------
[  220.732398] WARNING: CPU: 3 PID: 2548 at fs/btrfs/transaction.c:537
start_transaction+0x489/0x4f0
[  220.732400] Modules linked in: nft_objref nf_conntrack_netbios_ns
(...)
[  220.732439] CPU: 3 PID: 2548 Comm: mkdir Not tainted
5.9.0-rc2smack+ #81
[  220.732441] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009),
BIOS 1.13.0-2.fc32 04/01/2014
[  220.732444] RIP: 0010:start_transaction+0x489/0x4f0
[  220.732447] Code: e9 be fc ff ff (...)
[  220.732449] RSP: 0018:ffffc90001887d10 EFLAGS: 00010202
[  220.732452] RAX: ffff88816f1e0000 RBX: 0000000000000201 RCX:
0000000000000003
[  220.732454] RDX: 0000000000000201 RSI: 0000000000000002 RDI:
ffff888177849000
[  220.732456] RBP: ffff888177849000 R08: 0000000000000001 R09:
0000000000000004
[  220.732458] R10: ffffffff825e8f7a R11: 0000000000000003 R12:
ffffffffffffffe2
[  220.732460] R13: 0000000000000000 R14: ffff88803d884270 R15:
ffff8881680d8000
[  220.732463] FS:  00007f67317b8440(0000) GS:ffff88817bcc0000(0000)
knlGS:0000000000000000
[  220.732465] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  220.732467] CR2: 00007f67247a22a8 CR3: 000000004bfbc002 CR4:
0000000000370ee0
[  220.732472] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[  220.732474] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[  220.732475] Call Trace:
[  220.732480]  ? slab_free_freelist_hook+0xea/0x1b0
[  220.732483]  ? trace_hardirqs_on+0x1c/0xe0
[  220.732490]  btrfs_setxattr_trans+0x3c/0xf0
[  220.732496]  __vfs_setxattr+0x63/0x80
[  220.732502]  smack_d_instantiate+0x2d3/0x360
[  220.732507]  security_d_instantiate+0x29/0x40
[  220.732511]  d_instantiate_new+0x38/0x90
[  220.732515]  btrfs_mkdir+0x1cf/0x1e0
[  220.732521]  vfs_mkdir+0x14f/0x200
[  220.732525]  do_mkdirat+0x6d/0x110
[  220.732531]  do_syscall_64+0x2d/0x40
[  220.732534]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  220.732537] RIP: 0033:0x7f673196ae6b
[  220.732540] Code: 8b 05 11 (...)
[  220.732542] RSP: 002b:00007ffc3c679b18 EFLAGS: 00000246 ORIG_RAX:
0000000000000053
[  220.732545] RAX: ffffffffffffffda RBX: 00000000000001ff RCX:
00007f673196ae6b
[  220.732547] RDX: 0000000000000000 RSI: 00000000000001ff RDI:
00007ffc3c67a30d
[  220.732549] RBP: 00007ffc3c67a30d R08: 00000000000001ff R09:
0000000000000000
[  220.732551] R10: 000055d3e39fe930 R11: 0000000000000246 R12:
0000000000000000
[  220.732553] R13: 00007ffc3c679cd8 R14: 00007ffc3c67a30d R15:
00007ffc3c679ce0
[  220.732563] irq event stamp: 11029
[  220.732566] hardirqs last  enabled at (11037): [<ffffffff81153fe6>]
console_unlock+0x486/0x670
[  220.732569] hardirqs last disabled at (11044): [<ffffffff81153c01>]
console_unlock+0xa1/0x670
[  220.732572] softirqs last  enabled at (8864): [<ffffffff81e0102f>]
asm_call_on_stack+0xf/0x20
[  220.732575] softirqs last disabled at (8851): [<ffffffff81e0102f>]
asm_call_on_stack+0xf/0x20
[  220.732577] ---[ end trace 8f958916039daced ]---

This happens because at btrfs_mkdir() we call d_instantiate_new() while
holding a transaction handle, which results in the following call chain:

   btrfs_mkdir()
      trans = btrfs_start_transaction(root, 5);

      d_instantiate_new()
         smack_d_instantiate()
             __vfs_setxattr()
                 btrfs_setxattr_trans()
                    btrfs_start_transaction()
                       start_transaction()
                          WARN_ON()
                            --> a tansaction start has TRANS_EXTWRITERS
                                set in its type
                          h->orig_rsv = h->block_rsv
                          h->block_rsv = NULL

      btrfs_end_transaction(trans)

Besides the warning triggered at start_transaction.c, we set the handle's
block_rsv to NULL which may cause some surprises later on.

So fix this by making btrfs_setxattr_trans() not start a transaction when
we already have a handle on one, stored in current->journal_info, and use
that handle. We are good to use the handle because at btrfs_mkdir() we
did
reserve space for the xattr and the inode item.

Reported-by: Casey Schaufler <ca...@schaufler-ca.com>
Link:
https://lore.kernel.org/linux-btrfs/434d856f-bd7b-4889-a6ec-e81aaebfa...@schaufler-ca.com/

Signed-off-by: Filipe Manana <fdman...@suse.com>
---
  fs/btrfs/xattr.c | 31 +++++++++++++++++++++++++++----
  1 file changed, 27 insertions(+), 4 deletions(-)

diff --git a/fs/btrfs/xattr.c b/fs/btrfs/xattr.c
index af6246f36a9e..03135dbb318a 100644
--- a/fs/btrfs/xattr.c
+++ b/fs/btrfs/xattr.c
@@ -229,11 +229,33 @@ int btrfs_setxattr_trans(struct inode *inode,
const char *name,
  {
      struct btrfs_root *root = BTRFS_I(inode)->root;
      struct btrfs_trans_handle *trans;
+    const bool start_trans = (current->journal_info == NULL);

Not sure about the call context, but shouldn't we also check
BTRFS_SEND_TRANS_STUB?

Or there is something else to prevent us getting journal_info as
BTRFS_SEND_TRANS_STUB?

Oh, never mind, it won't be possible the have journal_info ==
SEND_TRANS_STUB in the context, so the check should be good.

Thanks,
Qu

Thanks,
Qu

      int ret;

-    trans = btrfs_start_transaction(root, 2);
-    if (IS_ERR(trans))
-        return PTR_ERR(trans);
+    if (start_trans) {
+        /*
+         * 1 unit for inserting/updating/deleting the xattr
+         * 1 unit for the inode item update
+         */
+        trans = btrfs_start_transaction(root, 2);
+        if (IS_ERR(trans))
+            return PTR_ERR(trans);
+    } else {
+        /*
+         * This can happen when smack is enabled and a directory is
being
+         * created. It happens through d_instantiate_new(), which calls
+         * smack_d_instantiate(), which in turn calls
__vfs_setxattr() to
+         * set the transmute xattr (XATTR_NAME_SMACKTRANSMUTE) on the
+         * inode. We have already reserved space for the xattr and inode
+         * update at btrfs_mkdir(), so just use the transaction handle.
+         * We don't join or start a transaction, as that will reset the
+         * block_rsv of the handle and trigger a warning for the start
+         * case.
+         */
+        ASSERT(strncmp(name, XATTR_SECURITY_PREFIX,
+                   XATTR_SECURITY_PREFIX_LEN) == 0);
+        trans = current->journal_info;
+    }

      ret = btrfs_setxattr(trans, inode, name, value, size, flags);
      if (ret)
@@ -244,7 +266,8 @@ int btrfs_setxattr_trans(struct inode *inode,
const char *name,
      ret = btrfs_update_inode(trans, root, BTRFS_I(inode));
      BUG_ON(ret);
  out:
-    btrfs_end_transaction(trans);
+    if (start_trans)
+        btrfs_end_transaction(trans);
      return ret;
  }


Reply via email to