[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Following a suggestion in http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=695182#513 I tested older kernels: seems that PAE was always (or at least back to karmic 2.6.31) prone to OOM under the sleep test. Curiously I found that non-PAE kernels of that vintage might also OOM, maybe depending on the value of /proc/sys/fs/file-max . Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- Machine: Dell Optiplex 760, 4GB RAM Results: Version Name kernel 4.10Warty 2.6.8 (could not install from CD) 5.04Hoary 2.6.10 5.10Breezy2.6.12 6.06Dapper2.6.15 6.10Edgy 2.6.17 7.04Feisty2.6.20(could not install from CD) 7.10Gutsy 2.6.22(could not install from CD) 8.04Hardy 2.6.24 8.10Intrepid 2.6.27-7 (no PAE) non-PAE(file-max=330344,default)=OK (about 19000 to file limit) non-PAE(file-max=555000)=OOM (after about 31000) 9.04Jaunty2.6.28 9.10Karmic2.6.31-14 PAE=OOM non-PAE(file-max=327956,default)=OK (about 18500 to file limit) non-PAE(file-max=444000)=OK (about 26000 to file limit) non-PAE(file-max=555000)=OK (to PID limit) non-PAE(file-max=888000)=OK (to PID limit) non-PAE(file-max=99)=OOM (after about 32500) 10.04 Lucid 2.6.32-65 PAE=OOM non-PAE(file-max=330827,default)=OK (about 19000 to file limit) non-PAE(file-max=377000)=OK (about 21500 to file limit) non-PAE(file-max=444000)=OK (about 25500 to file limit) non-PAE(file-max=555000)=OOM (after about 31500) 10.10 Maverick 2.6.35-22 (PAE=?) non-PAE(file-max=330677,default)=OK (to PID limit) 11.04 Natty 2.6.38-8 PAE=OOM non-PAE(file-max=330426,default)=OK (to PID limit) 11.10 Oneiric 3.0.0 PAE=OOM non-PAE=OK 12.04 Precise 3.2 12.10 Quantal 3.5 13.04 Raring3.8 13.10 Saucy 3.11 14.04 Trusty3.13 PAE=OOM (no non-PAE kernel) 14.10 Utopic3.16 References: http://en.wikipedia.org/wiki/List_of_Ubuntu_releases https://wiki.ubuntu.com/Releases http://old-releases.ubuntu.com/releases/ http://bugs.debian.org/695182 http://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098961 Commands used: sudo sh -c 'echo 99 /proc/sys/fs/file-max' bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' ** Bug watch added: Debian Bug tracker #695182 http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=695182 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Expired Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Should a CVE be assigned, seeing how this issue causes a DoS? And then a USN/DSA should be issued when eventually solved. Should it be reported to SecurityFocus and obtain a BID? Could the Status be changed to something better than Expired? Would some kind soul confirm and explicitly state that this affects multiple users? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Expired Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296 Low:865208 214080 651128 High: 412 4671081755304 -/+ buffers/cache: 1615602926060 Swap: 2920 0 2920 and re-trying the sleep test, it ran into OOM after 18000 or so sleeps and crashed/froze so I had to press the POWER button to recover. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24. AplayDevices: List of PLAYBACK Hardware Devices card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 ApportVersion: 2.0.1-0ubuntu15.1 Architecture: i386 AudioDevicesInUse: USERPID ACCESS COMMAND /dev/snd/controlC0: psz2190 F pulseaudio CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found. Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0xdfddc000 irq 45' Mixer name : 'SigmaTel STAC9227' Components : 'HDA:83847618,102801dd,00100201' Controls : 38 Simple ctrls : 21 CurrentDmesg: [ 28.160013] eth0: no IPv6 routers present DistroRelease: Ubuntu 12.04 HibernationDevice: RESUME=UUID=9d2bf7ac-9b0a-4082-ac45-f4d3c8e32c23 IwConfig: lono wireless extensions. eth0 no wireless extensions. MachineType: Dell Inc. Dell DM061 MarkForUpload: True Package: linux (not installed) ProcEnviron: TERM=xterm PATH=(custom, no user) LANG=C SHELL=/bin/bash ProcFB: 0 inteldrmfb ProcKernelCmdLine: root=/dev/mapper/isw_cheedcedhh_DadMirroredTB4 ro quiet splash ProcVersionSignature: Ubuntu 3.2.0-35.55-generic-pae 3.2.34 RelatedPackageVersions: linux-restricted-modules-3.2.0-35-generic-pae N/A linux-backports-modules-3.2.0-35-generic-pae N/A linux-firmware1.79.1 RfKill: Tags: precise Uname: Linux 3.2.0-35-generic-pae i686 UpgradeStatus: Upgraded to precise on 2012-04-27 (260 days ago) UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare WifiSyslog: Jan 13 06:42:46 DellE520 NetworkManager[1384]: info Unmanaged Device found; state CONNECTED forced. (see http://bugs.launchpad.net/bugs/191889) WpaSupplicantLog: dmi.bios.date: 03/23/2007 dmi.bios.vendor: Dell Inc. dmi.bios.version: 2.2.1 dmi.board.name: 0WG864 dmi.board.vendor: Dell Inc. dmi.chassis.type: 6 dmi.chassis.vendor: Dell Inc. dmi.modalias:
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
** Tags added: trusty -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Expired Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296 Low:865208 214080 651128 High: 412 4671081755304 -/+ buffers/cache: 1615602926060 Swap: 2920 0 2920 and re-trying the sleep test, it ran into OOM after 18000 or so sleeps and crashed/froze so I had to press the POWER button to recover. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24. AplayDevices: List of PLAYBACK Hardware Devices card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 ApportVersion: 2.0.1-0ubuntu15.1 Architecture: i386 AudioDevicesInUse: USERPID ACCESS COMMAND /dev/snd/controlC0: psz2190 F pulseaudio CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found. Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0xdfddc000 irq 45' Mixer name : 'SigmaTel STAC9227' Components : 'HDA:83847618,102801dd,00100201' Controls : 38 Simple ctrls : 21 CurrentDmesg: [ 28.160013] eth0: no IPv6 routers present DistroRelease: Ubuntu 12.04 HibernationDevice: RESUME=UUID=9d2bf7ac-9b0a-4082-ac45-f4d3c8e32c23 IwConfig: lono wireless extensions. eth0 no wireless extensions. MachineType: Dell Inc. Dell DM061 MarkForUpload: True Package: linux (not installed) ProcEnviron: TERM=xterm PATH=(custom, no user) LANG=C SHELL=/bin/bash ProcFB: 0 inteldrmfb ProcKernelCmdLine: root=/dev/mapper/isw_cheedcedhh_DadMirroredTB4 ro quiet splash ProcVersionSignature: Ubuntu 3.2.0-35.55-generic-pae 3.2.34 RelatedPackageVersions: linux-restricted-modules-3.2.0-35-generic-pae N/A linux-backports-modules-3.2.0-35-generic-pae N/A linux-firmware1.79.1 RfKill: Tags: precise Uname: Linux 3.2.0-35-generic-pae i686 UpgradeStatus: Upgraded to precise on 2012-04-27 (260 days ago) UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare WifiSyslog: Jan 13 06:42:46 DellE520 NetworkManager[1384]: info Unmanaged Device found; state CONNECTED forced. (see http://bugs.launchpad.net/bugs/191889) WpaSupplicantLog: dmi.bios.date: 03/23/2007 dmi.bios.vendor: Dell Inc. dmi.bios.version: 2.2.1 dmi.board.name: 0WG864 dmi.board.vendor: Dell Inc. dmi.chassis.type: 6 dmi.chassis.vendor: Dell Inc. dmi.modalias: dmi:bvnDellInc.:bvr2.2.1:bd03/23/2007:svnDellInc.:pnDellDM061:pvr:rvnDellInc.:rn0WG864:rvr:cvnDellInc.:ct6:cvr: dmi.product.name: Dell DM061 dmi.sys.vendor: Dell Inc. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098961/+subscriptions -- Mailing list:
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Now at Ubuntu VERSION=14.04.1 LTS, Trusty Tahr seems that the one-and-only x86 kernel offered is with PAE, so I cannot easily demonstrate the difference between PAE and non-PAE kernels. With the PAE kernel, my machine produces an OOM crash with the test command bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' My conjecture is that all Linux machines will produce an OOM, regardless of the amount of RAM memory installed. Your challenge is to prove me wrong: find any one machine that survives the above command without an OOM condition. (Obviously the bug, and thus the challenge, is for Linux machines with x86 CPUs and PAE kernels; but is for any distros, not just Ubuntu.) Should you succeed, you will have helped to better understand this bug and so contribute to finding a solution; I also offer the prize of a carton of beer to the first finder; and having knocked me off my soapbox, you will be allowed to feel smug and superior forever. Should you fail the challenge and have your machine reproduce an OOM crash, I urge you to complain to your Linux distributor or to the kernel people. (Seems that bugs are left unfixed for years if only a few people complain, to the point where Ubuntu abandoned the working version non-PAE kernel.) --- Maybe a workaround is to upgrade to 64-bit Linux with amd64 kernel. However... that is just a crude workaround without any guarantee of correctness, until some understanding of this bug: the kernel code is common between 32- and 64-bit. --- Long and boring details below. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia - psz@DellE520:~$ dpkg -l | grep linux-image ii linux-image-3.13.0-35-generic 3.13.0-35.62 i386 Linux kernel image for version 3.13.0 on 32 bit x86 SMP ii linux-image-extra-3.13.0-35-generic 3.13.0-35.62 i386 Linux kernel extra modules for version 3.13.0 on 32 bit x86 SMP ii linux-image-generic 3.13.0.35.42 i386 Generic Linux kernel image psz@DellE520:~$ psz@DellE520:~$ uname -a Linux DellE520 3.13.0-35-generic #62-Ubuntu SMP Fri Aug 15 01:58:01 UTC 2014 i686 i686 i686 GNU/Linux psz@DellE520:~$ psz@DellE520:~$ grep -H PAE /boot/conf* /boot/config-3.13.0-35-generic:CONFIG_X86_PAE=y psz@DellE520:~$ psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 309241610278322064584 142520 125124 519192 Low:870004 444336 425668 High: 412 5834961638916 -/+ buffers/cache: 3835162708900 Swap: 2920 0 2920 psz@DellE520:~$ psz@DellE520:~$ ulimit -u 23950 psz@DellE520:~$ psz@DellE520:~$ bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' The above produces an OOM as shown in /var/log/syslog : Sep 15 17:12:54 DellE520 kernel: [ 403.028298] bash invoked oom-killer: gfp_mask=0x2084d0, order=0, oom_score_adj=0 Sep 15 17:12:54 DellE520 kernel: [ 403.028309] bash cpuset=/ mems_allowed=0 Sep 15 17:12:54 DellE520 kernel: [ 403.028317] CPU: 1 PID: 19509 Comm: bash Not tainted 3.13.0-35-generic #62-Ubuntu Sep 15 17:12:54 DellE520 kernel: [ 403.028320] Hardware name: Dell Inc. Dell DM061 /0WG864, BIOS 2.4.0 05/24/2007 Sep 15 17:12:54 DellE520 kernel: [ 403.028324] eb845d68 c1652f30 eb86b400 eb845dc0 c164f4d9 c18437dc Sep 15 17:12:54 DellE520 kernel: [ 403.028336] eb86b70c 002084d0 f7b8d140 0046 Sep 15 17:12:54 DellE520 kernel: [ 403.028346] 0206 c1944560 eb845dc0 e6e90d00 002084d0 eb845dcc Sep 15 17:12:54 DellE520 kernel: [ 403.028357] Call Trace: Sep 15 17:12:54 DellE520 kernel: [ 403.028372] [c1652f30] dump_stack+0x41/0x52 Sep 15 17:12:54 DellE520 kernel: [ 403.028378] [c164f4d9] dump_header.isra.9+0x76/0x1c4 Sep 15 17:12:54 DellE520 kernel: [ 403.028386] [c11251d7] oom_kill_process+0x167/0x2b0 Sep 15 17:12:54 DellE520 kernel: [ 403.028393] [c12706fc] ? security_capable_noaudit+0x1c/0x30 Sep 15 17:12:54 DellE520 kernel: [ 403.028399] [c105fa5a] ? has_capability_noaudit+0x1a/0x30 Sep 15 17:12:54 DellE520 kernel: [ 403.028405] [c11256eb] out_of_memory+0x22b/0x260 Sep 15 17:12:54 DellE520 kernel: [ 403.028410] [c112a011] __alloc_pages_nodemask+0x861/0x980 Sep 15 17:12:54 DellE520 kernel: [ 403.028417] [c112a14c] __get_free_pages+0x1c/0x40 Sep 15 17:12:54 DellE520 kernel: [ 403.028422] [c104e3f8] pgd_alloc+0x38/0x250 Sep 15 17:12:54
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Expired? That is weird... how come it did not expire between comments#18 and comments#19, between 16Jan and 6Aug? Anyway the bug is alive... as anyone could easily test, with a simple bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' On any PAE machine (any amount of RAM, any version userland code, and any version kernel, I tested many), that should produce an OOM in a minute or so. Conversely: does anyone have a PAE machine where the above does not produce an OOM? Yes I know I promised to test latest development and upstream or somesuch. Sorry I did not get around to it, yet: the test would be rather disruptive on my home computer. I also note the suggestion (from kernel developers?) in the references above, to use amd64 kernel even with 32-bit userland. Maybe Ubuntu could follow suit. Cheers, Paul -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Expired Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296 Low:865208 214080 651128 High: 412 4671081755304 -/+ buffers/cache: 1615602926060 Swap: 2920 0 2920 and re-trying the sleep test, it ran into OOM after 18000 or so sleeps and crashed/froze so I had to press the POWER button to recover. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24. AplayDevices: List of PLAYBACK Hardware Devices card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 ApportVersion: 2.0.1-0ubuntu15.1 Architecture: i386 AudioDevicesInUse: USERPID ACCESS COMMAND /dev/snd/controlC0: psz2190 F pulseaudio CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found. Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0xdfddc000 irq 45' Mixer name : 'SigmaTel STAC9227' Components : 'HDA:83847618,102801dd,00100201' Controls : 38 Simple ctrls : 21 CurrentDmesg: [ 28.160013] eth0: no IPv6 routers present DistroRelease: Ubuntu 12.04 HibernationDevice: RESUME=UUID=9d2bf7ac-9b0a-4082-ac45-f4d3c8e32c23 IwConfig: lono wireless extensions. eth0 no wireless extensions. MachineType: Dell Inc. Dell DM061 MarkForUpload: True Package: linux (not installed) ProcEnviron: TERM=xterm PATH=(custom, no user) LANG=C SHELL=/bin/bash ProcFB: 0 inteldrmfb ProcKernelCmdLine: root=/dev/mapper/isw_cheedcedhh_DadMirroredTB4 ro quiet splash ProcVersionSignature: Ubuntu 3.2.0-35.55-generic-pae 3.2.34 RelatedPackageVersions: linux-restricted-modules-3.2.0-35-generic-pae N/A linux-backports-modules-3.2.0-35-generic-pae N/A linux-firmware1.79.1 RfKill: Tags:
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
[Expired for linux (Ubuntu) because there has been no activity for 60 days.] ** Changed in: linux (Ubuntu) Status: Incomplete = Expired -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Expired Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296 Low:865208 214080 651128 High: 412 4671081755304 -/+ buffers/cache: 1615602926060 Swap: 2920 0 2920 and re-trying the sleep test, it ran into OOM after 18000 or so sleeps and crashed/froze so I had to press the POWER button to recover. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24. AplayDevices: List of PLAYBACK Hardware Devices card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 ApportVersion: 2.0.1-0ubuntu15.1 Architecture: i386 AudioDevicesInUse: USERPID ACCESS COMMAND /dev/snd/controlC0: psz2190 F pulseaudio CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found. Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0xdfddc000 irq 45' Mixer name : 'SigmaTel STAC9227' Components : 'HDA:83847618,102801dd,00100201' Controls : 38 Simple ctrls : 21 CurrentDmesg: [ 28.160013] eth0: no IPv6 routers present DistroRelease: Ubuntu 12.04 HibernationDevice: RESUME=UUID=9d2bf7ac-9b0a-4082-ac45-f4d3c8e32c23 IwConfig: lono wireless extensions. eth0 no wireless extensions. MachineType: Dell Inc. Dell DM061 MarkForUpload: True Package: linux (not installed) ProcEnviron: TERM=xterm PATH=(custom, no user) LANG=C SHELL=/bin/bash ProcFB: 0 inteldrmfb ProcKernelCmdLine: root=/dev/mapper/isw_cheedcedhh_DadMirroredTB4 ro quiet splash ProcVersionSignature: Ubuntu 3.2.0-35.55-generic-pae 3.2.34 RelatedPackageVersions: linux-restricted-modules-3.2.0-35-generic-pae N/A linux-backports-modules-3.2.0-35-generic-pae N/A linux-firmware1.79.1 RfKill: Tags: precise Uname: Linux 3.2.0-35-generic-pae i686 UpgradeStatus: Upgraded to precise on 2012-04-27 (260 days ago) UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare WifiSyslog: Jan 13 06:42:46 DellE520 NetworkManager[1384]: info Unmanaged Device found; state CONNECTED forced. (see http://bugs.launchpad.net/bugs/191889) WpaSupplicantLog: dmi.bios.date: 03/23/2007 dmi.bios.vendor: Dell Inc. dmi.bios.version: 2.2.1 dmi.board.name: 0WG864 dmi.board.vendor: Dell Inc. dmi.chassis.type: 6 dmi.chassis.vendor: Dell Inc. dmi.modalias: dmi:bvnDellInc.:bvr2.2.1:bd03/23/2007:svnDellInc.:pnDellDM061:pvr:rvnDellInc.:rn0WG864:rvr:cvnDellInc.:ct6:cvr: dmi.product.name: Dell DM061 dmi.sys.vendor: Dell Inc. To manage notifications about
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Dear Christopher, Thanks for the hint about the BIOS. I updated it, the BIOS version went from 2.2.1 to 2.4.0. With the new BIOS and running non-PAE kernel: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-51-generic #77-Ubuntu SMP Wed Jul 24 20:21:10 UTC 2013 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 308796011285041959456 0 152800 705708 Low:861452 213880 647572 High: 2226508 9146241311884 -/+ buffers/cache: 2699962817964 Swap: 2920 0 2920 it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine (as expected), running out of max user processes because of default setting of ulimit -u 23964. Installing the PAE kernel and rebooting (of course with new BIOS): psz@DellE520:~$ uname -a Linux DellE520 3.2.0-51-generic-pae #77-Ubuntu SMP Wed Jul 24 20:40:32 UTC 2013 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087592 6107402476852 0 112796 344216 Low:865180 160496 704684 High: 412 4502441772168 -/+ buffers/cache: 1537282933864 Swap: 2920 0 2920 (has default setting ulimit -u 23960) and re-trying the sleep test, things went bad after 22500 processes. The graphics console crashed, with X saying your graphics device is running in low-resolution mode or somesuch, and with no mouse or keyboard response so could not click the OK button. Hitting Alt-Ctrl-F2 worked and got a getty login. Looking with ps, there were no sleep processes left running. In /var/log/syslog I found: Aug 8 20:01:48 DellE520 kernel: [ 341.000570] oom_kill_process: 39 callbacks suppressed Aug 8 20:01:48 DellE520 kernel: [ 341.000578] bash invoked oom-killer: gfp_mask=0x84d0, order=0, oom_adj=0, oom_score_adj=0 Aug 8 20:01:48 DellE520 kernel: [ 341.000584] bash cpuset=/ mems_allowed=0 Aug 8 20:01:48 DellE520 kernel: [ 341.000589] Pid: 2375, comm: bash Not tainted 3.2.0-51-generic-pae #77-Ubuntu Aug 8 20:01:48 DellE520 kernel: [ 341.000593] Call Trace: Aug 8 20:01:48 DellE520 kernel: [ 341.000605] [c10fa6c5] dump_header.isra.6+0x85/0xc0 Aug 8 20:01:48 DellE520 kernel: [ 341.000612] [c10fa8fc] oom_kill_process+0x5c/0x80 Aug 8 20:01:48 DellE520 kernel: [ 341.000617] [c10fad15] out_of_memory+0xc5/0x1c0 Aug 8 20:01:48 DellE520 kernel: [ 341.000623] [c10febcc] __alloc_pages_nodemask+0x72c/0x740 Aug 8 20:01:48 DellE520 kernel: [ 341.000630] [c10fec5c] __get_free_pages+0x1c/0x30 Aug 8 20:01:48 DellE520 kernel: [ 341.000636] [c103af54] pgd_alloc+0x74/0x250 Aug 8 20:01:48 DellE520 kernel: [ 341.000642] [c10440ca] ? place_entity+0x9a/0x160 Aug 8 20:01:48 DellE520 kernel: [ 341.000648] [c1058487] mm_init+0xc7/0x100 Aug 8 20:01:48 DellE520 kernel: [ 341.000653] [c1058f23] dup_mm+0xa3/0x230 Aug 8 20:01:48 DellE520 kernel: [ 341.000658] [c10597fd] copy_process.part.23+0x71d/0xc10 Aug 8 20:01:48 DellE520 kernel: [ 341.000663] [c1059d6e] copy_process+0x7e/0x90 Aug 8 20:01:48 DellE520 kernel: [ 341.000668] [c1059eb4] do_fork+0xf4/0x2c0 Aug 8 20:01:48 DellE520 kernel: [ 341.000675] [c1019874] sys_clone+0x34/0x40 Aug 8 20:01:48 DellE520 kernel: [ 341.000682] [c15b21d9] ptregs_clone+0x15/0x3c Aug 8 20:01:48 DellE520 kernel: [ 341.000687] [c15b209f] ? sysenter_do_call+0x12/0x28 Aug 8 20:01:48 DellE520 kernel: [ 341.000691] Mem-Info: Aug 8 20:01:48 DellE520 kernel: [ 341.000694] DMA per-cpu: Aug 8 20:01:48 DellE520 kernel: [ 341.000699] CPU0: hi:0, btch: 1 usd: 0 Aug 8 20:01:48 DellE520 kernel: [ 341.000702] CPU1: hi:0, btch: 1 usd: 0 Aug 8 20:01:48 DellE520 kernel: [ 341.000705] Normal per-cpu: Aug 8 20:01:48 DellE520 kernel: [ 341.000708] CPU0: hi: 186, btch: 31 usd: 0 Aug 8 20:01:48 DellE520 kernel: [ 341.000711] CPU1: hi: 186, btch: 31 usd: 29 Aug 8 20:01:48 DellE520 kernel: [ 341.000714] HighMem per-cpu: Aug 8 20:01:48 DellE520 kernel: [ 341.000717] CPU0: hi: 186, btch: 31 usd: 0 Aug 8 20:01:48 DellE520 kernel: [ 341.000720] CPU1: hi: 186, btch: 31 usd: 167 Aug 8 20:01:48 DellE520 kernel: [ 341.000726] active_anon:326331 inactive_anon:81629 isolated_anon:0 Aug 8 20:01:48 DellE520 kernel: [ 341.000728] active_file:11586 inactive_file:8659 isolated_file:0 Aug 8 20:01:48 DellE520 kernel: [ 341.000729] unevictable:0 dirty:0 writeback:2 unstable:0 Aug 8 20:01:48 DellE520 kernel: [ 341.000730] free:25897 slab_reclaimable:2285 slab_unreclaimable:60037 Aug 8 20:01:48 DellE520 kernel: [ 341.000732] mapped:14875 shmem:32942 pagetables:111532 bounce:0 Aug 8 20:01:48 DellE520 kernel: [ 341.000740]
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Dear Christopher, ... test the latest upstream kernel ... Did not I do that already in https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098961/comments/18 ? Is there really a need to re-test? (Yes I know that the kernel now is different from what I tested... but unless you know that something related was fixed, are you hoping for an accidental fix?) But yes, OK, I love hard work... will do... Cheers, Paul -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Incomplete Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296 Low:865208 214080 651128 High: 412 4671081755304 -/+ buffers/cache: 1615602926060 Swap: 2920 0 2920 and re-trying the sleep test, it ran into OOM after 18000 or so sleeps and crashed/froze so I had to press the POWER button to recover. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24. AplayDevices: List of PLAYBACK Hardware Devices card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 ApportVersion: 2.0.1-0ubuntu15.1 Architecture: i386 AudioDevicesInUse: USERPID ACCESS COMMAND /dev/snd/controlC0: psz2190 F pulseaudio CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found. Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0xdfddc000 irq 45' Mixer name : 'SigmaTel STAC9227' Components : 'HDA:83847618,102801dd,00100201' Controls : 38 Simple ctrls : 21 CurrentDmesg: [ 28.160013] eth0: no IPv6 routers present DistroRelease: Ubuntu 12.04 HibernationDevice: RESUME=UUID=9d2bf7ac-9b0a-4082-ac45-f4d3c8e32c23 IwConfig: lono wireless extensions. eth0 no wireless extensions. MachineType: Dell Inc. Dell DM061 MarkForUpload: True Package: linux (not installed) ProcEnviron: TERM=xterm PATH=(custom, no user) LANG=C SHELL=/bin/bash ProcFB: 0 inteldrmfb ProcKernelCmdLine: root=/dev/mapper/isw_cheedcedhh_DadMirroredTB4 ro quiet splash ProcVersionSignature: Ubuntu 3.2.0-35.55-generic-pae 3.2.34 RelatedPackageVersions: linux-restricted-modules-3.2.0-35-generic-pae N/A linux-backports-modules-3.2.0-35-generic-pae N/A linux-firmware1.79.1 RfKill: Tags: precise Uname: Linux 3.2.0-35-generic-pae i686 UpgradeStatus: Upgraded to precise on 2012-04-27 (260 days ago) UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare WifiSyslog: Jan 13 06:42:46 DellE520 NetworkManager[1384]: info Unmanaged Device found; state CONNECTED forced. (see http://bugs.launchpad.net/bugs/191889) WpaSupplicantLog: dmi.bios.date: 03/23/2007 dmi.bios.vendor: Dell Inc. dmi.bios.version: 2.2.1 dmi.board.name: 0WG864 dmi.board.vendor: Dell
[Kernel-packages] [Bug 1098961] Re: PAE regression: OOM with just a few sleeps
Paul Szabo, as per http://www.dell.com/support/drivers/us/en/19/driverdetails?driverid=R157196 an update is available for your BIOS (2.4.0). If you update to this, does it change anything? If not, could you please both specify what happened, and provide the output of the following terminal command: sudo dmidecode -s bios-version sudo dmidecode -s bios-release-date Thank you for your understanding. ** Tags added: bios-outdated-2.4.0 needs-upstream-testing precise regression-potential ** Changed in: linux (Ubuntu) Status: Confirmed = Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1098961 Title: PAE regression: OOM with just a few sleeps Status in “linux” package in Ubuntu: Incomplete Bug description: There is a spurious OOM issue with PAE kernel: it will suffer an OOM crash just by running a few processes. Please see also http://bugs.debian.org/695182 and discussion on linux...@kvack.org e.g. http://marc.info/?l=linux-mmm=135801969519193w=2 I wonder whether https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1098342 is related. The issue is a regression with PAE, reproduced and verified on my home PC with 3GB RAM. My PC was running kernel linux-image-3.2.0-35-generic so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic #55-Ubuntu SMP Wed Dec 5 17:45:18 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087972 6922562395716 0 18276 427116 Low:861464 71372 790092 High: 2226508 6208841605624 -/+ buffers/cache: 2468642841108 Swap: 2920 258364 19742556 Then it handled the sleep test bash -c 'n=0; while [ $n -lt 33000 ]; do sleep 600 ((n=n+1)); ((m=n%500)); if [ $m -lt 1 ]; then echo -n $n - ; date; free -l; sleep 1; fi; done' just fine, stopped only by max user processes (default setting of ulimit -u 23964), or raising that limit stopped when the machine ran out of PID space; there was no OOM. Installing and running the PAE kernel so it showed: psz@DellE520:~$ uname -a Linux DellE520 3.2.0-35-generic-pae #55-Ubuntu SMP Wed Dec 5 18:04:39 UTC 2012 i686 i686 i386 GNU/Linux psz@DellE520:~$ free -l total used free sharedbuffers cached Mem: 3087620 6811882406432 0 167332 352296 Low:865208 214080 651128 High: 412 4671081755304 -/+ buffers/cache: 1615602926060 Swap: 2920 0 2920 and re-trying the sleep test, it ran into OOM after 18000 or so sleeps and crashed/froze so I had to press the POWER button to recover. Cheers, Paul Paul Szabo p...@maths.usyd.edu.au http://www.maths.usyd.edu.au/u/psz/ School of Mathematics and Statistics University of SydneyAustralia --- AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24. AplayDevices: List of PLAYBACK Hardware Devices card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 ApportVersion: 2.0.1-0ubuntu15.1 Architecture: i386 AudioDevicesInUse: USERPID ACCESS COMMAND /dev/snd/controlC0: psz2190 F pulseaudio CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found. Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0xdfddc000 irq 45' Mixer name : 'SigmaTel STAC9227' Components : 'HDA:83847618,102801dd,00100201' Controls : 38 Simple ctrls : 21 CurrentDmesg: [ 28.160013] eth0: no IPv6 routers present DistroRelease: Ubuntu 12.04 HibernationDevice: RESUME=UUID=9d2bf7ac-9b0a-4082-ac45-f4d3c8e32c23 IwConfig: lono wireless extensions. eth0 no wireless extensions. MachineType: Dell Inc. Dell DM061 MarkForUpload: True Package: linux (not installed) ProcEnviron: TERM=xterm PATH=(custom, no user) LANG=C SHELL=/bin/bash ProcFB: 0 inteldrmfb ProcKernelCmdLine: root=/dev/mapper/isw_cheedcedhh_DadMirroredTB4 ro quiet splash ProcVersionSignature: Ubuntu 3.2.0-35.55-generic-pae 3.2.34 RelatedPackageVersions: linux-restricted-modules-3.2.0-35-generic-pae N/A linux-backports-modules-3.2.0-35-generic-pae N/A linux-firmware1.79.1 RfKill: Tags: precise Uname: Linux 3.2.0-35-generic-pae i686 UpgradeStatus: Upgraded to precise on 2012-04-27 (260 days ago) UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare WifiSyslog: Jan 13 06:42:46 DellE520 NetworkManager[1384]: info Unmanaged Device found; state CONNECTED forced. (see http://bugs.launchpad.net/bugs/191889)