Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-12-05 Thread Andy Lutomirski
On Sun, Nov 18, 2012 at 2:55 PM, David Rientjes wrote: > On Sat, 17 Nov 2012, Marc Duponcheel wrote: > >> # echo always >/sys/kernel/mm/transparent_hugepage/enabled >> # while [ 1 ] >> do >>sleep 10 >>date >>echo = vmstat >>egrep "(thp|compact)" /proc/vmstat >>echo =

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-12-05 Thread Andy Lutomirski
On Sun, Nov 18, 2012 at 2:55 PM, David Rientjes rient...@google.com wrote: On Sat, 17 Nov 2012, Marc Duponcheel wrote: # echo always /sys/kernel/mm/transparent_hugepage/enabled # while [ 1 ] do sleep 10 date echo = vmstat egrep (thp|compact) /proc/vmstat echo =

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-18 Thread David Rientjes
On Sat, 17 Nov 2012, Marc Duponcheel wrote: > # echo always >/sys/kernel/mm/transparent_hugepage/enabled > # while [ 1 ] > do >sleep 10 >date >echo = vmstat >egrep "(thp|compact)" /proc/vmstat >echo = khugepaged stack >cat /proc/501/stack > done > /tmp/49361. > #

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-18 Thread David Rientjes
On Sat, 17 Nov 2012, Marc Duponcheel wrote: # echo always /sys/kernel/mm/transparent_hugepage/enabled # while [ 1 ] do sleep 10 date echo = vmstat egrep (thp|compact) /proc/vmstat echo = khugepaged stack cat /proc/501/stack done /tmp/49361. # emerge icedtea

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-16 Thread Marc Duponcheel
Hi David, others Results seem OK recap: I have 2 6core 64bit opterons and I make -j13 I do # echo always >/sys/kernel/mm/transparent_hugepage/enabled # while [ 1 ] do sleep 10 date echo = vmstat egrep "(thp|compact)" /proc/vmstat echo = khugepaged stack cat

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-16 Thread Marc Duponcheel
Hi David, others Results seem OK recap: I have 2 6core 64bit opterons and I make -j13 I do # echo always /sys/kernel/mm/transparent_hugepage/enabled # while [ 1 ] do sleep 10 date echo = vmstat egrep (thp|compact) /proc/vmstat echo = khugepaged stack cat /proc/501/stack

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Marc Duponcheel
Hi David Thanks for the changeset I will test 3.6.6 without this weekend. Have a nice day On 2012 Nov 14, #David Rientjes wrote: > On Wed, 14 Nov 2012, Marc Duponcheel wrote: > > > Hi all > > > > If someone can provide the patches (or learn me how to get them with > > git (I apologise to

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread David Rientjes
On Wed, 14 Nov 2012, Marc Duponcheel wrote: > Hi all > > If someone can provide the patches (or learn me how to get them with > git (I apologise to not be git savy)) then, this weekend, I can apply > them to 3.6.6 and compare before/after to check if they fix #49361. > I've backported all

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Marc Duponcheel
Hi all If someone can provide the patches (or learn me how to get them with git (I apologise to not be git savy)) then, this weekend, I can apply them to 3.6.6 and compare before/after to check if they fix #49361. Thanks On 2012 Nov 14, Mel Gorman wrote: > On Tue, Nov 13, 2012 at 03:41:02PM

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Marc Duponcheel
On 2012 Nov 13, David Rientjes wrote: > On Wed, 14 Nov 2012, Marc Duponcheel wrote: > > > Hi all, please let me know if there is are patches you want me to try. > > > > FWIW time did not stand still and I run 3.6.6 now. > > Hmm, interesting since there are no core VM changes between 3.6.2,

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Mel Gorman
On Tue, Nov 13, 2012 at 03:41:02PM -0800, David Rientjes wrote: > On Tue, 13 Nov 2012, Andy Lutomirski wrote: > > > It just happened again. > > > > $ grep -E "compact_|thp_" /proc/vmstat > > compact_blocks_moved 8332448774 > > compact_pages_moved 21831286 > > compact_pagemigrate_failed 211260 >

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Mel Gorman
On Tue, Nov 13, 2012 at 03:41:02PM -0800, David Rientjes wrote: On Tue, 13 Nov 2012, Andy Lutomirski wrote: It just happened again. $ grep -E compact_|thp_ /proc/vmstat compact_blocks_moved 8332448774 compact_pages_moved 21831286 compact_pagemigrate_failed 211260 compact_stall

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Marc Duponcheel
On 2012 Nov 13, David Rientjes wrote: On Wed, 14 Nov 2012, Marc Duponcheel wrote: Hi all, please let me know if there is are patches you want me to try. FWIW time did not stand still and I run 3.6.6 now. Hmm, interesting since there are no core VM changes between 3.6.2, the kernel

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Marc Duponcheel
Hi all If someone can provide the patches (or learn me how to get them with git (I apologise to not be git savy)) then, this weekend, I can apply them to 3.6.6 and compare before/after to check if they fix #49361. Thanks On 2012 Nov 14, Mel Gorman wrote: On Tue, Nov 13, 2012 at 03:41:02PM

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread David Rientjes
On Wed, 14 Nov 2012, Marc Duponcheel wrote: Hi all If someone can provide the patches (or learn me how to get them with git (I apologise to not be git savy)) then, this weekend, I can apply them to 3.6.6 and compare before/after to check if they fix #49361. I've backported all the

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-14 Thread Marc Duponcheel
Hi David Thanks for the changeset I will test 3.6.6 withoutwith this weekend. Have a nice day On 2012 Nov 14, #David Rientjes wrote: On Wed, 14 Nov 2012, Marc Duponcheel wrote: Hi all If someone can provide the patches (or learn me how to get them with git (I apologise to not

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Wed, 14 Nov 2012, Marc Duponcheel wrote: > Hi all, please let me know if there is are patches you want me to try. > > FWIW time did not stand still and I run 3.6.6 now. > Hmm, interesting since there are no core VM changes between 3.6.2, the kernel you ran into problems with, and 3.6.6.

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Marc Duponcheel
Hi all, please let me know if there is are patches you want me to try. FWIW time did not stand still and I run 3.6.6 now. On 2012 Nov 13, #David Rientjes wrote: > On Tue, 13 Nov 2012, Andy Lutomirski wrote: > > > >> $ grep -E "compact_|thp_" /proc/vmstat > > >> compact_blocks_moved

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Tue, 13 Nov 2012, Andy Lutomirski wrote: > >> $ grep -E "compact_|thp_" /proc/vmstat > >> compact_blocks_moved 8332448774 > >> compact_pages_moved 21831286 > >> compact_pagemigrate_failed 211260 > >> compact_stall 13484 > >> compact_fail 6717 > >> compact_success 6755 > >> thp_fault_alloc

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Andy Lutomirski
On Tue, Nov 13, 2012 at 3:41 PM, David Rientjes wrote: > On Tue, 13 Nov 2012, Andy Lutomirski wrote: > >> It just happened again. >> >> $ grep -E "compact_|thp_" /proc/vmstat >> compact_blocks_moved 8332448774 >> compact_pages_moved 21831286 >> compact_pagemigrate_failed 211260 >> compact_stall

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Tue, 13 Nov 2012, Andy Lutomirski wrote: > It just happened again. > > $ grep -E "compact_|thp_" /proc/vmstat > compact_blocks_moved 8332448774 > compact_pages_moved 21831286 > compact_pagemigrate_failed 211260 > compact_stall 13484 > compact_fail 6717 > compact_success 6755 > thp_fault_alloc

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Andy Lutomirski
On Tue, Nov 13, 2012 at 3:11 PM, David Rientjes wrote: > On Tue, 13 Nov 2012, Andy Lutomirski wrote: > >> I've seen an odd problem three times in the past two weeks. I suspect >> a Linux 3.6 regression. I"m on 3.6.3-1.fc17.x86_64. I run a parallel >> compilation, and no progress is made. All

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Tue, 13 Nov 2012, Andy Lutomirski wrote: > I've seen an odd problem three times in the past two weeks. I suspect > a Linux 3.6 regression. I"m on 3.6.3-1.fc17.x86_64. I run a parallel > compilation, and no progress is made. All cpus are pegged at 100% > system time by the respective

[3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Andy Lutomirski
I've seen an odd problem three times in the past two weeks. I suspect a Linux 3.6 regression. I"m on 3.6.3-1.fc17.x86_64. I run a parallel compilation, and no progress is made. All cpus are pegged at 100% system time by the respective cc1plus processes. Reading /proc//stack shows either []

[3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Andy Lutomirski
I've seen an odd problem three times in the past two weeks. I suspect a Linux 3.6 regression. Im on 3.6.3-1.fc17.x86_64. I run a parallel compilation, and no progress is made. All cpus are pegged at 100% system time by the respective cc1plus processes. Reading /proc/pid/stack shows either

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Tue, 13 Nov 2012, Andy Lutomirski wrote: I've seen an odd problem three times in the past two weeks. I suspect a Linux 3.6 regression. Im on 3.6.3-1.fc17.x86_64. I run a parallel compilation, and no progress is made. All cpus are pegged at 100% system time by the respective cc1plus

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Andy Lutomirski
On Tue, Nov 13, 2012 at 3:11 PM, David Rientjes rient...@google.com wrote: On Tue, 13 Nov 2012, Andy Lutomirski wrote: I've seen an odd problem three times in the past two weeks. I suspect a Linux 3.6 regression. Im on 3.6.3-1.fc17.x86_64. I run a parallel compilation, and no progress is

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Tue, 13 Nov 2012, Andy Lutomirski wrote: It just happened again. $ grep -E compact_|thp_ /proc/vmstat compact_blocks_moved 8332448774 compact_pages_moved 21831286 compact_pagemigrate_failed 211260 compact_stall 13484 compact_fail 6717 compact_success 6755 thp_fault_alloc 150665

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Andy Lutomirski
On Tue, Nov 13, 2012 at 3:41 PM, David Rientjes rient...@google.com wrote: On Tue, 13 Nov 2012, Andy Lutomirski wrote: It just happened again. $ grep -E compact_|thp_ /proc/vmstat compact_blocks_moved 8332448774 compact_pages_moved 21831286 compact_pagemigrate_failed 211260 compact_stall

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Tue, 13 Nov 2012, Andy Lutomirski wrote: $ grep -E compact_|thp_ /proc/vmstat compact_blocks_moved 8332448774 compact_pages_moved 21831286 compact_pagemigrate_failed 211260 compact_stall 13484 compact_fail 6717 compact_success 6755 thp_fault_alloc 150665 thp_fault_fallback

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread Marc Duponcheel
Hi all, please let me know if there is are patches you want me to try. FWIW time did not stand still and I run 3.6.6 now. On 2012 Nov 13, #David Rientjes wrote: On Tue, 13 Nov 2012, Andy Lutomirski wrote: $ grep -E compact_|thp_ /proc/vmstat compact_blocks_moved 8332448774

Re: [3.6 regression?] THP + migration/compaction livelock (I think)

2012-11-13 Thread David Rientjes
On Wed, 14 Nov 2012, Marc Duponcheel wrote: Hi all, please let me know if there is are patches you want me to try. FWIW time did not stand still and I run 3.6.6 now. Hmm, interesting since there are no core VM changes between 3.6.2, the kernel you ran into problems with, and 3.6.6. --