Public bug reported:

Running the certification memory test using stress-ng on an ARM64 system
with 64GB roughly of RAM (MAAS shows 63GB).

The test runs several stress-ng memory related tests.  It appears that
the system locks up when the bigheap test runs, every time so far (two
of two runs have failed).

I'm doing a third now to confirm that bigheap is where the lockup
occurs.

I've also run this exact same test on similar and smaller memory amounts
on s390x and amd64 without problem.

This is also being done to provide data for a similar bug discovered on
Power (ppc64le).

To test this on an arm64 system:

Install Xenial
$ add-apt-repository ppa:hardware-certification/public
$ apt update
$ apt install canonical-certification-server
$ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng

the memory_stress_ng script is a wrapper for stress_ng that only calls
certain memory tests.  See script for an idea of how it's executing the
tests.

This could be the same issue that we're seeing on power, or it could be
a different issue for ARM that looks similar.  Here's the original Power
bug:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062

ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.4.0-31-generic 4.4.0-31.50
ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13
Uname: Linux 4.4.0-31-generic aarch64
AlsaDevices:
 total 0
 crw-rw---- 1 root audio 116,  1 Aug  5 09:27 seq
 crw-rw---- 1 root audio 116, 33 Aug  5 09:27 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
ApportVersion: 2.20.1-0ubuntu2.1
Architecture: arm64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
CRDA: N/A
Date: Fri Aug  5 16:15:51 2016
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
Lsusb: Error: command ['lsusb'] failed with exit code 1:
PciMultimedia:
 
ProcEnviron:
 TERM=xterm
 PATH=(custom, no user)
 XDG_RUNTIME_DIR=<set>
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcFB:
 
ProcKernelCmdLine: console=ttyS0,9600n8r ro
RelatedPackageVersions:
 linux-restricted-modules-4.4.0-31-generic N/A
 linux-backports-modules-4.4.0-31-generic  N/A
 linux-firmware                            1.157.2
RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: Confirmed


** Tags: apport-bug arm64 blocks-hwcert-server uec-images xenial

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1610320

Title:
  stress-ng memory testing causes Arm64 system to hang

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Running the certification memory test using stress-ng on an ARM64
  system with 64GB roughly of RAM (MAAS shows 63GB).

  The test runs several stress-ng memory related tests.  It appears that
  the system locks up when the bigheap test runs, every time so far (two
  of two runs have failed).

  I'm doing a third now to confirm that bigheap is where the lockup
  occurs.

  I've also run this exact same test on similar and smaller memory
  amounts on s390x and amd64 without problem.

  This is also being done to provide data for a similar bug discovered
  on Power (ppc64le).

  To test this on an arm64 system:

  Install Xenial
  $ add-apt-repository ppa:hardware-certification/public
  $ apt update
  $ apt install canonical-certification-server
  $ /usr/lib/plainbox-provider-checkbox/bin/memory_stress_ng

  the memory_stress_ng script is a wrapper for stress_ng that only calls
  certain memory tests.  See script for an idea of how it's executing
  the tests.

  This could be the same issue that we're seeing on power, or it could
  be a different issue for ARM that looks similar.  Here's the original
  Power bug:

  https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1573062

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-31-generic 4.4.0-31.50
  ProcVersionSignature: User Name 4.4.0-31.50-generic 4.4.13
  Uname: Linux 4.4.0-31-generic aarch64
  AlsaDevices:
   total 0
   crw-rw---- 1 root audio 116,  1 Aug  5 09:27 seq
   crw-rw---- 1 root audio 116, 33 Aug  5 09:27 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.1
  Architecture: arm64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Fri Aug  5 16:15:51 2016
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  PciMultimedia:
   
  ProcEnviron:
   TERM=xterm
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB:
   
  ProcKernelCmdLine: console=ttyS0,9600n8r ro
  RelatedPackageVersions:
   linux-restricted-modules-4.4.0-31-generic N/A
   linux-backports-modules-4.4.0-31-generic  N/A
   linux-firmware                            1.157.2
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1610320/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to