[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
Markng this invalid. SOMETHING resolved it, but we never figured out what. ** Changed in: linux (Ubuntu) Status: Confirmed => Invalid -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Invalid Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
Re: [Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
Not yet... I've tried 4.7.0-040700 on a 64GB arm64 system with no failures. I've also tried 4.4.0-33.52 on a 32GB ARM64 system with no failures. Currently trying 4.4.0-21.37 on the 32GB system and 4.4.0-34.53 on the 64GB system. I initially saw this on 4.4.0-31.50 on both systems, so I had expected 21.37 to fail immediately, but it's survived two passes so far. :/ On Thu, Aug 18, 2016 at 7:46 PM, Balbir Singh wrote: > Do we know what change fixed the issue? commit id? > > -- > You received this bug notification because you are subscribed to the bug > report. > https://bugs.launchpad.net/bugs/1610320 > > Title: > stress-ng memory testing causes Arm64 system to hang > > Status in linux package in Ubuntu: > Confirmed > > Bug description: > Running the certification memory test using stress-ng on an ARM64 > system with 64GB roughly of RAM (MAAS shows 63GB). > > The test runs several stress-ng memory related tests. It appears that > the system locks up when the bigheap test runs, every time so far (two > of two runs have failed). > > I'm doing a third now to confirm that bigheap is where the lockup > occurs. > > I've also run this exact same test on similar and smaller memory > amounts on s390x and amd64 without problem. > > This is also being done to provide data for a similar bug discovered > on Power (ppc64le). > > To test this on an arm64 system: > > Install Xenial > $ add-apt-repository ppa:hardware-certification/public > $ apt update > $ apt install canonical-certification-server > $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng > > the memory_stress_ng script is a wrapper for stress_ng that only calls > certain memory tests. See script for an idea of how it's executing > the tests. > > This could be the same issue that we're seeing on power, or it could > be a different issue for ARM that looks similar. Here's the original > Power bug: > > https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 > > ProblemType: Bug > DistroRelease: Ubuntu 16.04 > Package: linux-image-4.4.0-31-generic 4.4.0-31.50 > ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 > Uname: Linux 4.4.0-31-generic aarch64 > AlsaDevices: >total 0 >crw-rw 1 root audio 116, 1 Aug 5 09:27 seq >crw-rw 1 root audio 116, 33 Aug 5 09:27 timer > AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' > ApportVersion: 2.20.1-0ubuntu2.1 > Architecture: arm64 > ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' > AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', > '/dev/snd/timer'] failed with exit code 1: > CRDA: N/A > Date: Fri Aug 5 16:15:51 2016 > IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' > Lsusb: Error: command ['lsusb'] failed with exit code 1: > PciMultimedia: > > ProcEnviron: >TERM=xterm >PATH=(custom, no user) >XDG_RUNTIME_DIR= >LANG=en_US.UTF-8 >SHELL=/bin/bash > ProcFB: > > ProcKernelCmdLine: console=ttyS0,9600n8r ro > RelatedPackageVersions: >linux-restricted-modules-4.4.0-31-generic N/A >linux-backports-modules-4.4.0-31-generic N/A >linux-firmware1.157.2 > RfKill: Error: [Errno 2] No such file or directory: 'rfkill' > SourcePackage: linux > UpgradeStatus: No upgrade log present (probably fresh install) > > To manage notifications about this bug go to: > https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions > > Launchpad-Notification-Type: bug > Launchpad-Bug: distribution=ubuntu; sourcepackage=linux; component=main; > status=Confirmed; importance=High; assignee=None; > Launchpad-Bug-Tags: apport-bug arm64 blocks-hwcert-server kernel-da-key > kernel-fixed-upstream uec-images xenial > Launchpad-Bug-Information-Type: Public > Launchpad-Bug-Private: no > Launchpad-Bug-Security-Vulnerability: no > Launchpad-Bug-Commenters: bladernr brad-figg bsingharora jsalisbury mreed8855 > Launchpad-Bug-Reporter: Jeff Lane (bladernr) > Launchpad-Bug-Modifier: Balbir Singh (bsingharora) > Launchpad-Message-Rationale: Subscriber > Launchpad-Message-For: bladernr -- "Entropy isn't what it used to be." Jeff Lane - Server Certification Lead, Warrior Poet, Biker, Lover of Pie Phone: 919-442-8649 Ubuntu Ham: W4KDH Freenode IRC: bladernr or bladernr_ gpg: 1024D/3A14B2DD 8C88 B076 0DD7 B404 1417 C466 4ABD 3635 3A14 B2DD -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the big
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
Do we know what change fixed the issue? commit id? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
Re: [Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
Any chance you can back to -31 (16.04.1 stock) and see if it fails there? That's where I was failing on my systems, I'll see if I can grab -34 and give that a try too) On Fri, Aug 12, 2016 at 10:59 AM, Michael Reed wrote: > I just ran this on 4.4.0-34-generic from the dev ppa on an arm64 system > and it passed. I am re-running this now to see if it reliably passes. > > > ubuntu@ubuntu:~$ sudo apt-cache policy canonical-certification-server > canonical-certification-server: > Installed: 0.28+201607151353~ubuntu16.04.1 > Candidate: 0.28+201607151353~ubuntu16.04.1 > Version table: > *** 0.28+201607151353~ubuntu16.04.1 500 > 500 http://ppa.launchpad.net/checkbox-dev/ppa/ubuntu xenial/main > arm64 Packages > 100 /var/lib/dpkg/status > ubuntu@ubuntu:~$ uname -a > Linux ubuntu 4.4.0-34-generic #53-Ubuntu SMP Wed Jul 27 16:07:06 UTC 2016 > aarch64 aarch64 aarch64 GNU/Linux > > -- > You received this bug notification because you are subscribed to the bug > report. > https://bugs.launchpad.net/bugs/1610320 > > Title: > stress-ng memory testing causes Arm64 system to hang > > Status in linux package in Ubuntu: > Confirmed > > Bug description: > Running the certification memory test using stress-ng on an ARM64 > system with 64GB roughly of RAM (MAAS shows 63GB). > > The test runs several stress-ng memory related tests. It appears that > the system locks up when the bigheap test runs, every time so far (two > of two runs have failed). > > I'm doing a third now to confirm that bigheap is where the lockup > occurs. > > I've also run this exact same test on similar and smaller memory > amounts on s390x and amd64 without problem. > > This is also being done to provide data for a similar bug discovered > on Power (ppc64le). > > To test this on an arm64 system: > > Install Xenial > $ add-apt-repository ppa:hardware-certification/public > $ apt update > $ apt install canonical-certification-server > $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng > > the memory_stress_ng script is a wrapper for stress_ng that only calls > certain memory tests. See script for an idea of how it's executing > the tests. > > This could be the same issue that we're seeing on power, or it could > be a different issue for ARM that looks similar. Here's the original > Power bug: > > https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 > > ProblemType: Bug > DistroRelease: Ubuntu 16.04 > Package: linux-image-4.4.0-31-generic 4.4.0-31.50 > ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 > Uname: Linux 4.4.0-31-generic aarch64 > AlsaDevices: >total 0 >crw-rw 1 root audio 116, 1 Aug 5 09:27 seq >crw-rw 1 root audio 116, 33 Aug 5 09:27 timer > AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' > ApportVersion: 2.20.1-0ubuntu2.1 > Architecture: arm64 > ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' > AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', > '/dev/snd/timer'] failed with exit code 1: > CRDA: N/A > Date: Fri Aug 5 16:15:51 2016 > IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' > Lsusb: Error: command ['lsusb'] failed with exit code 1: > PciMultimedia: > > ProcEnviron: >TERM=xterm >PATH=(custom, no user) >XDG_RUNTIME_DIR= >LANG=en_US.UTF-8 >SHELL=/bin/bash > ProcFB: > > ProcKernelCmdLine: console=ttyS0,9600n8r ro > RelatedPackageVersions: >linux-restricted-modules-4.4.0-31-generic N/A >linux-backports-modules-4.4.0-31-generic N/A >linux-firmware1.157.2 > RfKill: Error: [Errno 2] No such file or directory: 'rfkill' > SourcePackage: linux > UpgradeStatus: No upgrade log present (probably fresh install) > > To manage notifications about this bug go to: > https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions > > Launchpad-Notification-Type: bug > Launchpad-Bug: distribution=ubuntu; sourcepackage=linux; component=main; > status=Confirmed; importance=High; assignee=None; > Launchpad-Bug-Tags: apport-bug arm64 blocks-hwcert-server kernel-da-key > kernel-fixed-upstream uec-images xenial > Launchpad-Bug-Information-Type: Public > Launchpad-Bug-Private: no > Launchpad-Bug-Security-Vulnerability: no > Launchpad-Bug-Commenters: bladernr brad-figg jsalisbury mreed8855 > Launchpad-Bug-Reporter: Jeff Lane (bladernr) > Launchpad-Bug-Modifier: Michael Reed (mreed8855) > Launchpad-Message-Rationale: Subscriber > Launchpad-Message-For: bladernr -- "Entropy isn't what it used to be." Jeff Lane - Server Certification Lead, Warrior Poet, Biker, Lover of Pie Phone: 919-442-8649 Ubuntu Ham: W4KDH Freenode IRC: bladernr or bladernr_ gpg: 1024D/3A14B2DD 8C88 B076 0DD7 B404 1417 C466 4ABD 3635 3A14 B2DD -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
I just ran this on 4.4.0-34-generic from the dev ppa on an arm64 system and it passed. I am re-running this now to see if it reliably passes. ubuntu@ubuntu:~$ sudo apt-cache policy canonical-certification-server canonical-certification-server: Installed: 0.28+201607151353~ubuntu16.04.1 Candidate: 0.28+201607151353~ubuntu16.04.1 Version table: *** 0.28+201607151353~ubuntu16.04.1 500 500 http://ppa.launchpad.net/checkbox-dev/ppa/ubuntu xenial/main arm64 Packages 100 /var/lib/dpkg/status ubuntu@ubuntu:~$ uname -a Linux ubuntu 4.4.0-34-generic #53-Ubuntu SMP Wed Jul 27 16:07:06 UTC 2016 aarch64 aarch64 aarch64 GNU/Linux -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
I built a v4.4.16 kernel, which can be downloaded from: http://kernel.ubuntu.com/~jsalisbury/lp1610320/v4.4.16/ Can you give that kernel a try? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
That is good news the 4.7 kernel does not exhibit the bug. We should be able to perform a "Reverse" bisect to identify the commit that fixes the bug. We should next test the latest 4.4.16 upstream stable kernel to see if the fix was already sent to upstream stable? If 4.4.16 still has the bug, we should then see if upstream 4.6 has the bug. However, the arm64 kernels were just recently added to the automatic build process in 4.7, so I'll manually build you those kernels. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
** Tags removed: kernel-key ** Tags added: kernel-da-key -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
So running the upstream 4.7.0 kernel and had 0 failures out of 31 runs so it's pretty safe to assume that this isn't an affect upstream. ** Tags added: kernel-fixed-upstream -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
This appears to reliably happen with the bigheap test on ARM64. I've tried two systems and both lock up shortly after the bigheap test begins. This may need some input from @cking as stress-ng is his test. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1610320] Re: stress-ng memory testing causes Arm64 system to hang
Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.7 kernel[0]. If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'. If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'. Once testing of the upstream kernel is complete, please mark this bug as "Confirmed". Thanks in advance. [0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.7 ** Changed in: linux (Ubuntu) Importance: Undecided => High ** Tags added: kernel-key -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1610320 Title: stress-ng memory testing causes Arm64 system to hang Status in linux package in Ubuntu: Confirmed Bug description: Running the certification memory test using stress-ng on an ARM64 system with 64GB roughly of RAM (MAAS shows 63GB). The test runs several stress-ng memory related tests. It appears that the system locks up when the bigheap test runs, every time so far (two of two runs have failed). I'm doing a third now to confirm that bigheap is where the lockup occurs. I've also run this exact same test on similar and smaller memory amounts on s390x and amd64 without problem. This is also being done to provide data for a similar bug discovered on Power (ppc64le). To test this on an arm64 system: Install Xenial $ add-apt-repository ppa:hardware-certification/public $ apt update $ apt install canonical-certification-server $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng the memory_stress_ng script is a wrapper for stress_ng that only calls certain memory tests. See script for an idea of how it's executing the tests. This could be the same issue that we're seeing on power, or it could be a different issue for ARM that looks similar. Here's the original Power bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062 ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.4.0-31-generic 4.4.0-31.50 ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13 Uname: Linux 4.4.0-31-generic aarch64 AlsaDevices: total 0 crw-rw 1 root audio 116, 1 Aug 5 09:27 seq crw-rw 1 root audio 116, 33 Aug 5 09:27 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.20.1-0ubuntu2.1 Architecture: arm64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Date: Fri Aug 5 16:15:51 2016 IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig' Lsusb: Error: command ['lsusb'] failed with exit code 1: PciMultimedia: ProcEnviron: TERM=xterm PATH=(custom, no user) XDG_RUNTIME_DIR= LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: ProcKernelCmdLine: console=ttyS0,9600n8r ro RelatedPackageVersions: linux-restricted-modules-4.4.0-31-generic N/A linux-backports-modules-4.4.0-31-generic N/A linux-firmware1.157.2 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp