RE: 6e543d5780e fixed a boot hang
>-Original Message- >From: Fengguang Wu [mailto:fengguang...@intel.com] >Sent: 2013年10月9日 22:12 >To: Lisa Du >Cc: KOSAKI Motohiro; linux...@kvack.org; linux-kernel@vger.kernel.org >Subject: 6e543d5780e fixed a boot hang > >Greetings, > >FYI, this commit seem to fix a boot hang problem here. > >commit 6e543d5780e36ff5ee56c44d7e2e30db3457a7ed >Author: Lisa Du >Date: Wed Sep 11 14:22:36 2013 -0700 > >mm: vmscan: fix do_try_to_free_pages() livelock > > >[1.394871] pci :00:02.0: Boot video device >[1.395883] PCI: CLS 0 bytes, default 64 > >In parent commit, it will hang right here. > >With this commit, it will continue to emit the below OOM messages (which is >not a surprise to me because the boot test runs in a small >memory KVM and the kconfig builds in lots of drivers). I think you may meet the same issue as mine. Direct reclaim loop forever with zone->all_unreclaimable = 0(as kswapd sleeps forever). And at the boot stage, no one detect and terminate it, so you see the boot hang. After apply this patch, you see there's oom-killer invoked as direct reclaim would break when zone was unreclaimable. > >[1.631892] swapper/0 invoked oom-killer: gfp_mask=0x2000d0, > order=1, oom_score_adj=0 >[1.633549] swapper/0 cpuset=/ mems_allowed=0 >[1.634443] CPU: 1 PID: 1 Comm: swapper/0 Not tainted > 3.12.0-rc4-00019-g8b5ede6 #126 >[1.635982] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011 >[1.637088] 0002 88001dd41b28 82c8d78f > 88001ef7c040 >[1.638955] 88001dd41ba8 82c8395f 83c54680 > 88001dd41b60 >[1.640830] 810f3f06 1eb4 0246 > 88001dd41b98 >[1.642687] Call Trace: >[1.643313] [] dump_stack+0x54/0x74 >[1.644331] [] dump_header.isra.10+0x7a/0x1ba >[1.645443] [] ? > lock_release_holdtime.part.27+0x4c/0x50 >[1.646685] [] ? lock_release+0x189/0x1d1 >[1.647744] [] out_of_memory+0x39e/0x3ee >[1.648882] [] __alloc_pages_nodemask+0x668/0x7de >[1.650385] [] kmem_getpages+0x75/0x16c >[1.651429] [] fallback_alloc+0x12c/0x1ea >[1.652528] [] ? trace_hardirqs_off+0xd/0xf >[1.653627] [] cache_alloc_node+0x14a/0x159 >[1.654783] [] ? dma_debug_init+0x1ef/0x29a >[1.655928] [] kmem_cache_alloc_trace+0x83/0x11a >[1.657108] [] dma_debug_init+0x1ef/0x29a >[1.658182] [] pci_iommu_init+0x16/0x52 >[1.659263] [] ? iommu_setup+0x27d/0x27d >[1.660342] [] do_one_initcall+0x93/0x137 >[1.661415] [] ? param_set_charp+0x92/0xd8 >[1.662503] [] ? parse_args+0x189/0x247 >[1.663555] [] kernel_init_freeable+0x15e/0x1df >[1.664724] [] ? do_early_param+0x88/0x88 >[1.665814] [] ? rest_init+0xdb/0xdb >[1.666824] [] kernel_init+0xe/0xdb >[1.667824] [] ret_from_fork+0x7c/0xb0 >[1.668911] [] ? rest_init+0xdb/0xdb >[1.669925] Mem-Info: >[1.670508] Node 0 DMA per-cpu: > >Thanks, >Fengguang N�Р骒r��yb�X�肚�v�^�)藓{.n�+�伐�{��赙zXФ�≤�}��财�z�:+v�����赙zZ+��+zf"�h���~i���z��wア�?�ㄨ��&�)撷f��^j谦y�m��@A�a囤� 0鹅h���i
RE: 6e543d5780e fixed a boot hang
-Original Message- From: Fengguang Wu [mailto:fengguang...@intel.com] Sent: 2013年10月9日 22:12 To: Lisa Du Cc: KOSAKI Motohiro; linux...@kvack.org; linux-kernel@vger.kernel.org Subject: 6e543d5780e fixed a boot hang Greetings, FYI, this commit seem to fix a boot hang problem here. commit 6e543d5780e36ff5ee56c44d7e2e30db3457a7ed Author: Lisa Du c...@marvell.com Date: Wed Sep 11 14:22:36 2013 -0700 mm: vmscan: fix do_try_to_free_pages() livelock [1.394871] pci :00:02.0: Boot video device [1.395883] PCI: CLS 0 bytes, default 64 In parent commit, it will hang right here. With this commit, it will continue to emit the below OOM messages (which is not a surprise to me because the boot test runs in a small memory KVM and the kconfig builds in lots of drivers). I think you may meet the same issue as mine. Direct reclaim loop forever with zone-all_unreclaimable = 0(as kswapd sleeps forever). And at the boot stage, no one detect and terminate it, so you see the boot hang. After apply this patch, you see there's oom-killer invoked as direct reclaim would break when zone was unreclaimable. [1.631892] swapper/0 invoked oom-killer: gfp_mask=0x2000d0, order=1, oom_score_adj=0 [1.633549] swapper/0 cpuset=/ mems_allowed=0 [1.634443] CPU: 1 PID: 1 Comm: swapper/0 Not tainted 3.12.0-rc4-00019-g8b5ede6 #126 [1.635982] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011 [1.637088] 0002 88001dd41b28 82c8d78f 88001ef7c040 [1.638955] 88001dd41ba8 82c8395f 83c54680 88001dd41b60 [1.640830] 810f3f06 1eb4 0246 88001dd41b98 [1.642687] Call Trace: [1.643313] [82c8d78f] dump_stack+0x54/0x74 [1.644331] [82c8395f] dump_header.isra.10+0x7a/0x1ba [1.645443] [810f3f06] ? lock_release_holdtime.part.27+0x4c/0x50 [1.646685] [810f795a] ? lock_release+0x189/0x1d1 [1.647744] [811530a8] out_of_memory+0x39e/0x3ee [1.648882] [811579f5] __alloc_pages_nodemask+0x668/0x7de [1.650385] [8118eb53] kmem_getpages+0x75/0x16c [1.651429] [81190d20] fallback_alloc+0x12c/0x1ea [1.652528] [810f38e8] ? trace_hardirqs_off+0xd/0xf [1.653627] [81190be5] cache_alloc_node+0x14a/0x159 [1.654783] [817059fb] ? dma_debug_init+0x1ef/0x29a [1.655928] [8119162c] kmem_cache_alloc_trace+0x83/0x11a [1.657108] [817059fb] dma_debug_init+0x1ef/0x29a [1.658182] [841ac38b] pci_iommu_init+0x16/0x52 [1.659263] [841ac375] ? iommu_setup+0x27d/0x27d [1.660342] [810020d2] do_one_initcall+0x93/0x137 [1.661415] [810bd300] ? param_set_charp+0x92/0xd8 [1.662503] [810bd52e] ? parse_args+0x189/0x247 [1.663555] [8419fed1] kernel_init_freeable+0x15e/0x1df [1.664724] [8419f729] ? do_early_param+0x88/0x88 [1.665814] [82c77867] ? rest_init+0xdb/0xdb [1.666824] [82c77875] kernel_init+0xe/0xdb [1.667824] [82cbc57c] ret_from_fork+0x7c/0xb0 [1.668911] [82c77867] ? rest_init+0xdb/0xdb [1.669925] Mem-Info: [1.670508] Node 0 DMA per-cpu: Thanks, Fengguang N�Р骒r��yb�X�肚�v�^�)藓{.n�+�伐�{��赙zXФ�≤�}��财�z�j:+v�����赙zZ+��+zf"�h���~i���z��wア�?�ㄨ���)撷f��^j谦y�m��@A�a囤� 0鹅h���i