Am 07.10.2016 um 09:17 schrieb Wang Xiaoguang: > Hi, > > On 10/07/2016 03:03 PM, Stefan Priebe - Profihost AG wrote: >> Dear Wang, >> >> can't use v4.8.0 as i always get OOMs and total machine crashes. >> >> Complete traces with your patch and some more btrfs patches applied (in >> the hope in fixes the OOM but it did not): >> http://pastebin.com/raw/6vmRSDm1 > I didn't see any such OOMs... > Can you try holger's tree with my patches.
Dear wang already tried that. Doesn't help. It also happens only on two out of three servers. It starts killing low men processes after time. But I've no idea where all those memory is consumed. (Have 64gb) Greets, Stefan > Regards, > Xiaoguang Wang >> >> Greets, >> Stefan >> Am 06.10.2016 um 05:04 schrieb Wang Xiaoguang: >>> Hi, >>> >>> On 09/29/2016 03:27 PM, Stefan Priebe - Profihost AG wrote: >>>> Am 29.09.2016 um 09:13 schrieb Wang Xiaoguang: >>>>>>> I found that compress sometime report ENOSPC error even in 4.8-rc8, >>>>>>> currently >>>>>> I cannot confirm that as i do not have anough space to test this >>>>>> without >>>>>> compression ;-( But yes i've compression enabled. >>>>> I might not get you, my poor english :) >>>>> You mean that you only get ENOSPC error when compression is enabled? >>>>> >>>>> And when compression is not enabled, you do not get ENOSPC error? >>>> I can't tell you. I cannot test with compression not enabled. I do not >>>> have anough free space on this disk. >>> I had just sent two patches to fix false enospc error for compression, >>> please have a try, they fix false enospc error in my test environment. >>> btrfs: fix false enospc for compression >>> btrfs: improve inode's outstanding_extents computation >>> >>> I apply these two patchs in linux upstream tree, the latest commit >>> is 41844e36206be90cd4d962ea49b0abc3612a99d0. >>> >>> Regards, >>> Xiaoguang Wang >>> >>>>>>> I'm trying to fix it. >>>>>> That sounds good but do you also get the >>>>>> BTRFS: space_info 4 has 18446742286429913088 free, is not full >>>>>> >>>>>> kernel messages on umount? if not you might have found another >>>>>> problem. >>>>> Yes, I seem similar messages, you can paste you whole dmesg info here. >>>> ------------[ cut here ]------------ >>>> WARNING: CPU: 2 PID: 5187 at fs/btrfs/extent-tree.c:5790 >>>> btrfs_free_block_groups+0x346/0x430 [btrfs]() >>>> Modules linked in: netconsole xt_multiport iptable_filter ip_tables >>>> x_tables 8021q garp bonding x86_pkg_temp_thermal coretemp kvm_intel kvm >>>> irqbypass sb_edac crc32_pclmul edac_core i2c_i801 i40e(O) vxlan >>>> ip6_udp_tunnel udp_tunnel shpchp ipmi_si ipmi_msghandler button loop >>>> btrfs dm_mod raid10 raid0 multipath linear raid456 async_raid6_recov >>>> async_memcpy async_pq async_xor async_tx xor raid6_pq igb i2c_algo_bit >>>> i2c_core usbhid raid1 md_mod xhci_pci sg ehci_pci xhci_hcd ehci_hcd >>>> sd_mod ahci usbcore ptp libahci usb_common pps_core aacraid >>>> CPU: 2 PID: 5187 Comm: umount Tainted: G O 4.4.22+63-ph #1 >>>> Hardware name: Supermicro Super Server/X10SRi-F, BIOS 2.0 12/17/2015 >>>> 0000000000000000 ffff880fda777d00 ffffffff813b69c3 0000000000000000 >>>> ffffffffc067a099 ffff880fda777d38 ffffffff810821c6 0000000000000000 >>>> ffff880074bf0a00 ffff88103c10c088 ffff88103c10c000 ffff88103c10c098 >>>> Call Trace: >>>> [<ffffffff813b69c3>] dump_stack+0x63/0x90 >>>> [<ffffffff810821c6>] warn_slowpath_common+0x86/0xc0 >>>> [<ffffffff810822ba>] warn_slowpath_null+0x1a/0x20 >>>> [<ffffffffc05de9a6>] btrfs_free_block_groups+0x346/0x430 [btrfs] >>>> [<ffffffffc05f0bad>] close_ctree+0x15d/0x330 [btrfs] >>>> [<ffffffffc05bf409>] btrfs_put_super+0x19/0x20 [btrfs] >>>> [<ffffffff811c85af>] generic_shutdown_super+0x6f/0x100 >>>> [<ffffffff811c8652>] kill_anon_super+0x12/0x20 >>>> [<ffffffffc05c0376>] btrfs_kill_super+0x16/0xa0 [btrfs] >>>> [<ffffffff811c7ff3>] deactivate_locked_super+0x43/0x70 >>>> [<ffffffff811c807c>] deactivate_super+0x5c/0x60 >>>> [<ffffffff811e3b5f>] cleanup_mnt+0x3f/0x90 >>>> [<ffffffff811e3bf2>] __cleanup_mnt+0x12/0x20 >>>> [<ffffffff8109dad1>] task_work_run+0x81/0xa0 >>>> [<ffffffff810021c0>] exit_to_usermode_loop+0xb0/0xc0 >>>> [<ffffffff81002b44>] syscall_return_slowpath+0xd4/0x130 >>>> [<ffffffff8169dc0c>] int_ret_from_sys_call+0x25/0x8f >>>> ---[ end trace cee6ace13018e13e ]--- >>>> ------------[ cut here ]------------ >>>> WARNING: CPU: 2 PID: 5187 at fs/btrfs/extent-tree.c:5791 >>>> btrfs_free_block_groups+0x365/0x430 [btrfs]() >>>> Modules linked in: netconsole xt_multiport iptable_filter ip_tables >>>> x_tables 8021q garp bonding x86_pkg_temp_thermal coretemp kvm_intel kvm >>>> irqbypass sb_edac crc32_pclmul edac_core i2c_i801 i40e(O) vxlan >>>> ip6_udp_tunnel udp_tunnel shpchp ipmi_si ipmi_msghandler button loop >>>> btrfs dm_mod raid10 raid0 multipath linear raid456 async_raid6_recov >>>> async_memcpy async_pq async_xor async_tx xor raid6_pq igb i2c_algo_bit >>>> i2c_core usbhid raid1 md_mod xhci_pci sg ehci_pci xhci_hcd ehci_hcd >>>> sd_mod ahci usbcore ptp libahci usb_common pps_core aacraid >>>> CPU: 2 PID: 5187 Comm: umount Tainted: G W O 4.4.22+63-ph #1 >>>> Hardware name: Supermicro Super Server/X10SRi-F, BIOS 2.0 12/17/2015 >>>> 0000000000000000 ffff880fda777d00 ffffffff813b69c3 0000000000000000 >>>> ffffffffc067a099 ffff880fda777d38 ffffffff810821c6 0000000000000000 >>>> ffff880074bf0a00 ffff88103c10c088 ffff88103c10c000 ffff88103c10c098 >>>> Call Trace: >>>> [<ffffffff813b69c3>] dump_stack+0x63/0x90 >>>> [<ffffffff810821c6>] warn_slowpath_common+0x86/0xc0 >>>> [<ffffffff810822ba>] warn_slowpath_null+0x1a/0x20 >>>> [<ffffffffc05de9c5>] btrfs_free_block_groups+0x365/0x430 [btrfs] >>>> [<ffffffffc05f0bad>] close_ctree+0x15d/0x330 [btrfs] >>>> [<ffffffffc05bf409>] btrfs_put_super+0x19/0x20 [btrfs] >>>> [<ffffffff811c85af>] generic_shutdown_super+0x6f/0x100 >>>> [<ffffffff811c8652>] kill_anon_super+0x12/0x20 >>>> [<ffffffffc05c0376>] btrfs_kill_super+0x16/0xa0 [btrfs] >>>> [<ffffffff811c7ff3>] deactivate_locked_super+0x43/0x70 >>>> [<ffffffff811c807c>] deactivate_super+0x5c/0x60 >>>> [<ffffffff811e3b5f>] cleanup_mnt+0x3f/0x90 >>>> [<ffffffff811e3bf2>] __cleanup_mnt+0x12/0x20 >>>> [<ffffffff8109dad1>] task_work_run+0x81/0xa0 >>>> [<ffffffff810021c0>] exit_to_usermode_loop+0xb0/0xc0 >>>> [<ffffffff81002b44>] syscall_return_slowpath+0xd4/0x130 >>>> [<ffffffff8169dc0c>] int_ret_from_sys_call+0x25/0x8f >>>> ---[ end trace cee6ace13018e13f ]--- >>>> ------------[ cut here ]------------ >>>> WARNING: CPU: 2 PID: 5187 at fs/btrfs/extent-tree.c:10151 >>>> btrfs_free_block_groups+0x291/0x430 [btrfs]() >>>> Modules linked in: netconsole xt_multiport iptable_filter ip_tables >>>> x_tables 8021q garp bonding x86_pkg_temp_thermal coretemp kvm_intel kvm >>>> irqbypass sb_edac crc32_pclmul edac_core i2c_i801 i40e(O) vxlan >>>> ip6_udp_tunnel udp_tunnel shpchp ipmi_si ipmi_msghandler button loop >>>> btrfs dm_mod raid10 raid0 multipath linear raid456 async_raid6_recov >>>> async_memcpy async_pq async_xor async_tx xor raid6_pq igb i2c_algo_bit >>>> i2c_core usbhid raid1 md_mod xhci_pci sg ehci_pci xhci_hcd ehci_hcd >>>> sd_mod ahci usbcore ptp libahci usb_common pps_core aacraid >>>> CPU: 2 PID: 5187 Comm: umount Tainted: G W O 4.4.22+63-ph #1 >>>> Hardware name: Supermicro Super Server/X10SRi-F, BIOS 2.0 12/17/2015 >>>> 0000000000000000 ffff880fda777d00 ffffffff813b69c3 0000000000000000 >>>> ffffffffc067a099 ffff880fda777d38 ffffffff810821c6 ffff880074bf0000 >>>> ffff880074bf0088 ffff880074bf0df8 ffff88103c10c000 0000000000000000 >>>> Call Trace: >>>> [<ffffffff813b69c3>] dump_stack+0x63/0x90 >>>> [<ffffffff810821c6>] warn_slowpath_common+0x86/0xc0 >>>> [<ffffffff810822ba>] warn_slowpath_null+0x1a/0x20 >>>> [<ffffffffc05de8f1>] btrfs_free_block_groups+0x291/0x430 [btrfs] >>>> [<ffffffffc05f0bad>] close_ctree+0x15d/0x330 [btrfs] >>>> [<ffffffffc05bf409>] btrfs_put_super+0x19/0x20 [btrfs] >>>> [<ffffffff811c85af>] generic_shutdown_super+0x6f/0x100 >>>> [<ffffffff811c8652>] kill_anon_super+0x12/0x20 >>>> [<ffffffffc05c0376>] btrfs_kill_super+0x16/0xa0 [btrfs] >>>> [<ffffffff811c7ff3>] deactivate_locked_super+0x43/0x70 >>>> [<ffffffff811c807c>] deactivate_super+0x5c/0x60 >>>> [<ffffffff811e3b5f>] cleanup_mnt+0x3f/0x90 >>>> [<ffffffff811e3bf2>] __cleanup_mnt+0x12/0x20 >>>> [<ffffffff8109dad1>] task_work_run+0x81/0xa0 >>>> [<ffffffff810021c0>] exit_to_usermode_loop+0xb0/0xc0 >>>> [<ffffffff81002b44>] syscall_return_slowpath+0xd4/0x130 >>>> [<ffffffff8169dc0c>] int_ret_from_sys_call+0x25/0x8f >>>> ---[ end trace cee6ace13018e140 ]--- >>>> BTRFS: space_info 4 has 18446742182612910080 free, is not full >>>> BTRFS: space_info total=82678120448, used=64784695296, pinned=0, >>>> reserved=0, may_use=1908990066688, readonly=0 >>>> >>>> Greets, >>>> Stefan >>>> >>>>> Regards, >>>>> Xiaoguang Wang >>>>> >>>>>> Stefan >>>>>> >>>>>>> Regards, >>>>>>> Xiaoguang Wang >>>>>>> >>>>>>>> I tried hard to reproduce it but i can't get it to reproduce with a >>>>>>>> test >>>>>>>> script. Any ideas? >>>>>>>> >>>>>>>> Stefan >>>>>>>> >>>>>>>>> Reagrds, >>>>>>>>> Xiaoguang Wang >>>>>>>>> >>>>>>>>>> Greets, >>>>>>>>>> Stefan >>>>>>>>>> >>>>>>>>>>> Regards, >>>>>>>>>>> Xiaoguang Wang >>>>>>>>>>>> Greets, >>>>>>>>>>>> Stefan >>>>>>>>>>>> -- >>>>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe >>>>>>>>>>>> linux-btrfs" in >>>>>>>>>>>> the body of a message to majord...@vger.kernel.org >>>>>>>>>>>> More majordomo info at >>>>>>>>>>>> http://vger.kernel.org/majordomo-info.html >>>>>>>>>>>> >>>>>>>>>>>> >>>>> >>> >>> >> > > > -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html