Re: kernel BUG at fs/btrfs/ctree.c:1144!

2013-04-10 Thread Ahmet Inan
> The real problem, however, is not caused by that commit but by a tree mod log
> bug. I expect that fs/btrfs/ctree.c:1144 is this BUG_ON in your kernel from
> __tree_mod_log_rewind (my line numbers don't match):
>
> 1138 switch (tm->op) {
> 1139 case MOD_LOG_KEY_REMOVE_WHILE_FREEING:
> 1140 BUG_ON(tm->slot < n);
>
> I've got a fix for that I'm currently testing, expect it on the list soon.
Yea, thats the line.

> For the meantime I recommend to not defrag your filesystem.
Ok, already disabled and rebooted.
Will come back to You if i still get that Problem.

> As a general remark, please send your stack traces inline, not as attachment 
> if
> possible.
Google mail always screws longer than 80 lines in plain text mode and
non plain text is not what You want.
Maybe i should go back to using mutt.

Thank You,

Ahmet
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: kernel BUG at fs/btrfs/ctree.c:1144!

2013-04-10 Thread Jan Schmidt

On Wed, April 10, 2013 at 09:58 (+0200), Ahmet Inan wrote:
> I got this problem since 3.8.5 + for-linus (from that time).
> Have just tried 3.8.6 + for-linus with "git merge -X theirs
> btrfs/for-linus" but still same problem.
> Going back to 3.7.4 + for-linus (from that time) doesn't give me the problem.

The stack you attached shows a function added in the snapshot aware defrag
patches (commit 38c227d8), added in 3.8.

The real problem, however, is not caused by that commit but by a tree mod log
bug. I expect that fs/btrfs/ctree.c:1144 is this BUG_ON in your kernel from
__tree_mod_log_rewind (my line numbers don't match):

1138 switch (tm->op) {
1139 case MOD_LOG_KEY_REMOVE_WHILE_FREEING:
1140 BUG_ON(tm->slot < n);

I've got a fix for that I'm currently testing, expect it on the list soon.

> This is an production nfs server with 2x2TB raid1, so cant reboot it that 
> often.
> Have seen this same problem on another system (also raid1) once, but
> rebooting helped, no problems since.
> Both systems use autodefrag, maybe that sometimes triggers it?
> I really would like to help, so i can stay on the latest kernels.
> What should i do?

For the meantime I recommend to not defrag your filesystem.

As a general remark, please send your stack traces inline, not as attachment if
possible.

Thanks,
-Jan
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


kernel BUG at fs/btrfs/ctree.c:1144!

2013-04-10 Thread Ahmet Inan
I got this problem since 3.8.5 + for-linus (from that time).
Have just tried 3.8.6 + for-linus with "git merge -X theirs
btrfs/for-linus" but still same problem.
Going back to 3.7.4 + for-linus (from that time) doesn't give me the problem.
This is an production nfs server with 2x2TB raid1, so cant reboot it that often.
Have seen this same problem on another system (also raid1) once, but
rebooting helped, no problems since.
Both systems use autodefrag, maybe that sometimes triggers it?
I really would like to help, so i can stay on the latest kernels.
What should i do?

Ahmet
[35103.977604] [ cut here ]
[35103.977633] kernel BUG at fs/btrfs/ctree.c:1144!
[35103.977653] invalid opcode:  [#1] SMP 
[35103.977674] Modules linked in: dm_mod fuse snd_hda_codec_analog 
snd_hda_intel snd_hda_codec snd_pcm snd_page_alloc snd_timer snd soundcore 
ppdev iTCO_wdt lpc_ich mfd_core parport_pc parport kvm
[35103.91] CPU 1 
[35103.977783] Pid: 2989, comm: btrfs-endio-wri Not tainted 3.8.6-ainan #119 
Dell Inc. OptiPlex 755 /0GM819
[35103.977825] RIP: 0010:[]  [] 
__tree_mod_log_rewind+0x25b/0x260
[35103.977864] RSP: 0018:880107a8f7b8  EFLAGS: 00010297
[35103.977885] RAX:  RBX: 88006a9094d0 RCX: 8801126124c0
[35103.977912] RDX: 0b6398ee RSI: 066c RDI: 880182953a40
[35103.977939] RBP: 880107a8f7e8 R08: 1000 R09: 880107a8f760
[35103.977965] R10: 0016ed05 R11:  R12: 880222fb76c0
[35103.977993] R13: 002f R14: 8801126124c0 R15: 0cd2
[35103.978019] FS:  () GS:880237c4() 
knlGS:
[35103.978049] CS:  0010 DS:  ES:  CR0: 8005003b
[35103.978071] CR2: 00b89308 CR3: 0001b5b2f000 CR4: 07e0
[35103.978097] DR0:  DR1:  DR2: 
[35103.978123] DR3:  DR6: 0ff0 DR7: 0400
[35103.978150] Process btrfs-endio-wri (pid: 2989, threadinfo 880107a8e000, 
task 88014516c0c0)
[35103.978182] Stack:
[35103.978192]  88006a909a44 0002 88006a9094d0 
88006a909a10
[35103.978227]  8801355bd400 1600 880107a8f888 
812b6266
[35103.978261]  880107a8f828 880222fb76c0 88022dea8000 
00010cd2
[35103.978295] Call Trace:
[35103.978309]  [] btrfs_search_old_slot+0x646/0x990
[35103.978336]  [] __resolve_indirect_refs+0x16f/0x610
[35103.978363]  [] ? free_extent_buffer+0x58/0xb0
[35103.978387]  [] ? __add_missing_keys.clone.13+0x89/0x130
[35103.978415]  [] ? free_extent_buffer+0x58/0xb0
[35103.978439]  [] find_parent_nodes+0x586/0x9a0
[35103.978462]  [] ? btrfs_set_path_blocking+0x31/0x70
[35103.978488]  [] btrfs_find_all_roots+0xa1/0x100
[35103.978513]  [] ? record_extent_backrefs+0xf0/0xf0
[35103.978538]  [] iterate_extent_inodes+0x183/0x2d0
[35103.978562]  [] ? btrfs_get_token_64+0x5f/0xf0
[35103.978587]  [] ? free_extent_buffer+0x58/0xb0
[35103.978612]  [] iterate_inodes_from_logical+0xa7/0xb0
[35103.978638]  [] ? record_extent_backrefs+0xf0/0xf0
[35103.978664]  [] record_extent_backrefs+0x78/0xf0
[35103.978689]  [] btrfs_finish_ordered_io+0x186/0xa70
[35103.978715]  [] ? try_to_del_timer_sync+0x76/0xa0
[35103.978740]  [] ? cascade+0xf0/0x190
[35103.978762]  [] finish_ordered_fn+0x10/0x20
[35103.978785]  [] worker_loop+0xb8/0x540
[35103.978807]  [] ? btrfs_queue_worker+0x310/0x310
[35103.978832]  [] ? btrfs_queue_worker+0x310/0x310
[35103.978857]  [] kthread+0xc6/0xd0
[35103.978877]  [] ? kthread_freezable_should_stop+0x70/0x70
[35103.978905]  [] ret_from_fork+0x7c/0xb0
[35103.978927]  [] ? kthread_freezable_should_stop+0x70/0x70
[35103.978952] Code: c1 49 63 46 58 48 89 c2 48 c1 e2 05 48 8d 54 10 65 49 63 
46 2c 48 89 c6 48 c1 e6 05 48 8d 74 30 65 e8 fa 9b 04 00 e9 ad fe ff ff <0f> 0b 
0f 0b 90 55 48 89 e5 41 56 49 89 d6 41 55 49 89 f5 41 54 
[35103.979149] RIP  [] __tree_mod_log_rewind+0x25b/0x260
[35103.979178]  RSP 
[35103.991131] ---[ end trace 3a477bfb86e5ca4a ]---