Re: kernel BUG at fs/btrfs/ctree.c:1144!
> The real problem, however, is not caused by that commit but by a tree mod log > bug. I expect that fs/btrfs/ctree.c:1144 is this BUG_ON in your kernel from > __tree_mod_log_rewind (my line numbers don't match): > > 1138 switch (tm->op) { > 1139 case MOD_LOG_KEY_REMOVE_WHILE_FREEING: > 1140 BUG_ON(tm->slot < n); > > I've got a fix for that I'm currently testing, expect it on the list soon. Yea, thats the line. > For the meantime I recommend to not defrag your filesystem. Ok, already disabled and rebooted. Will come back to You if i still get that Problem. > As a general remark, please send your stack traces inline, not as attachment > if > possible. Google mail always screws longer than 80 lines in plain text mode and non plain text is not what You want. Maybe i should go back to using mutt. Thank You, Ahmet -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: kernel BUG at fs/btrfs/ctree.c:1144!
On Wed, April 10, 2013 at 09:58 (+0200), Ahmet Inan wrote: > I got this problem since 3.8.5 + for-linus (from that time). > Have just tried 3.8.6 + for-linus with "git merge -X theirs > btrfs/for-linus" but still same problem. > Going back to 3.7.4 + for-linus (from that time) doesn't give me the problem. The stack you attached shows a function added in the snapshot aware defrag patches (commit 38c227d8), added in 3.8. The real problem, however, is not caused by that commit but by a tree mod log bug. I expect that fs/btrfs/ctree.c:1144 is this BUG_ON in your kernel from __tree_mod_log_rewind (my line numbers don't match): 1138 switch (tm->op) { 1139 case MOD_LOG_KEY_REMOVE_WHILE_FREEING: 1140 BUG_ON(tm->slot < n); I've got a fix for that I'm currently testing, expect it on the list soon. > This is an production nfs server with 2x2TB raid1, so cant reboot it that > often. > Have seen this same problem on another system (also raid1) once, but > rebooting helped, no problems since. > Both systems use autodefrag, maybe that sometimes triggers it? > I really would like to help, so i can stay on the latest kernels. > What should i do? For the meantime I recommend to not defrag your filesystem. As a general remark, please send your stack traces inline, not as attachment if possible. Thanks, -Jan -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
kernel BUG at fs/btrfs/ctree.c:1144!
I got this problem since 3.8.5 + for-linus (from that time). Have just tried 3.8.6 + for-linus with "git merge -X theirs btrfs/for-linus" but still same problem. Going back to 3.7.4 + for-linus (from that time) doesn't give me the problem. This is an production nfs server with 2x2TB raid1, so cant reboot it that often. Have seen this same problem on another system (also raid1) once, but rebooting helped, no problems since. Both systems use autodefrag, maybe that sometimes triggers it? I really would like to help, so i can stay on the latest kernels. What should i do? Ahmet [35103.977604] [ cut here ] [35103.977633] kernel BUG at fs/btrfs/ctree.c:1144! [35103.977653] invalid opcode: [#1] SMP [35103.977674] Modules linked in: dm_mod fuse snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_pcm snd_page_alloc snd_timer snd soundcore ppdev iTCO_wdt lpc_ich mfd_core parport_pc parport kvm [35103.91] CPU 1 [35103.977783] Pid: 2989, comm: btrfs-endio-wri Not tainted 3.8.6-ainan #119 Dell Inc. OptiPlex 755 /0GM819 [35103.977825] RIP: 0010:[] [] __tree_mod_log_rewind+0x25b/0x260 [35103.977864] RSP: 0018:880107a8f7b8 EFLAGS: 00010297 [35103.977885] RAX: RBX: 88006a9094d0 RCX: 8801126124c0 [35103.977912] RDX: 0b6398ee RSI: 066c RDI: 880182953a40 [35103.977939] RBP: 880107a8f7e8 R08: 1000 R09: 880107a8f760 [35103.977965] R10: 0016ed05 R11: R12: 880222fb76c0 [35103.977993] R13: 002f R14: 8801126124c0 R15: 0cd2 [35103.978019] FS: () GS:880237c4() knlGS: [35103.978049] CS: 0010 DS: ES: CR0: 8005003b [35103.978071] CR2: 00b89308 CR3: 0001b5b2f000 CR4: 07e0 [35103.978097] DR0: DR1: DR2: [35103.978123] DR3: DR6: 0ff0 DR7: 0400 [35103.978150] Process btrfs-endio-wri (pid: 2989, threadinfo 880107a8e000, task 88014516c0c0) [35103.978182] Stack: [35103.978192] 88006a909a44 0002 88006a9094d0 88006a909a10 [35103.978227] 8801355bd400 1600 880107a8f888 812b6266 [35103.978261] 880107a8f828 880222fb76c0 88022dea8000 00010cd2 [35103.978295] Call Trace: [35103.978309] [] btrfs_search_old_slot+0x646/0x990 [35103.978336] [] __resolve_indirect_refs+0x16f/0x610 [35103.978363] [] ? free_extent_buffer+0x58/0xb0 [35103.978387] [] ? __add_missing_keys.clone.13+0x89/0x130 [35103.978415] [] ? free_extent_buffer+0x58/0xb0 [35103.978439] [] find_parent_nodes+0x586/0x9a0 [35103.978462] [] ? btrfs_set_path_blocking+0x31/0x70 [35103.978488] [] btrfs_find_all_roots+0xa1/0x100 [35103.978513] [] ? record_extent_backrefs+0xf0/0xf0 [35103.978538] [] iterate_extent_inodes+0x183/0x2d0 [35103.978562] [] ? btrfs_get_token_64+0x5f/0xf0 [35103.978587] [] ? free_extent_buffer+0x58/0xb0 [35103.978612] [] iterate_inodes_from_logical+0xa7/0xb0 [35103.978638] [] ? record_extent_backrefs+0xf0/0xf0 [35103.978664] [] record_extent_backrefs+0x78/0xf0 [35103.978689] [] btrfs_finish_ordered_io+0x186/0xa70 [35103.978715] [] ? try_to_del_timer_sync+0x76/0xa0 [35103.978740] [] ? cascade+0xf0/0x190 [35103.978762] [] finish_ordered_fn+0x10/0x20 [35103.978785] [] worker_loop+0xb8/0x540 [35103.978807] [] ? btrfs_queue_worker+0x310/0x310 [35103.978832] [] ? btrfs_queue_worker+0x310/0x310 [35103.978857] [] kthread+0xc6/0xd0 [35103.978877] [] ? kthread_freezable_should_stop+0x70/0x70 [35103.978905] [] ret_from_fork+0x7c/0xb0 [35103.978927] [] ? kthread_freezable_should_stop+0x70/0x70 [35103.978952] Code: c1 49 63 46 58 48 89 c2 48 c1 e2 05 48 8d 54 10 65 49 63 46 2c 48 89 c6 48 c1 e6 05 48 8d 74 30 65 e8 fa 9b 04 00 e9 ad fe ff ff <0f> 0b 0f 0b 90 55 48 89 e5 41 56 49 89 d6 41 55 49 89 f5 41 54 [35103.979149] RIP [] __tree_mod_log_rewind+0x25b/0x260 [35103.979178] RSP [35103.991131] ---[ end trace 3a477bfb86e5ca4a ]---