[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2020-03-03 Thread Jeff Lane
** Tags removed: hwcert-server

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Artful:
  Fix Released
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2019-10-03 Thread Po-Hsu Lin
** Changed in: linux (Ubuntu)
   Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Artful:
  Fix Released
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-04-03 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 4.13.0-38.43

---
linux (4.13.0-38.43) artful; urgency=medium

  * linux: 4.13.0-38.43 -proposed tracker (LP: #1755762)

  * Servers going OOM after updating kernel from 4.10 to 4.13 (LP: #1748408)
- i40e: Fix memory leak related filter programming status
- i40e: Add programming descriptors to cleaned_count

  * [SRU] Lenovo E41 Mic mute hotkey is not responding (LP: #1753347)
- platform/x86: ideapad-laptop: Increase timeout to wait for EC answer

  * fails to dump with latest kpti fixes (LP: #1750021)
- kdump: write correct address of mem_section into vmcoreinfo

  * headset mic can't be detected on two Dell machines (LP: #1748807)
- ALSA: hda/realtek - Support headset mode for ALC215/ALC285/ALC289
- ALSA: hda - Fix headset mic detection problem for two Dell machines
- ALSA: hda - Fix a wrong FIXUP for alc289 on Dell machines

  * CIFS SMB2/SMB3 does not work for domain based DFS (LP: #1747572)
- CIFS: make IPC a regular tcon
- CIFS: use tcon_ipc instead of use_ipc parameter of SMB2_ioctl
- CIFS: dump IPC tcon in debug proc file

  * i2c-thunderx: erroneous error message "unhandled state: 0" (LP: #1754076)
- i2c: octeon: Prevent error message on bus error

  * hisi_sas: Add disk LED support (LP: #1752695)
- scsi: hisi_sas: directly attached disk LED feature for v2 hw

  * EDAC, sb_edac: Backport 1 patch to Ubuntu 17.10 (Fix missing DIMM sysfs
entries with KNL SNC2/SNC4 mode) (LP: #1743856)
- EDAC, sb_edac: Fix missing DIMM sysfs entries with KNL SNC2/SNC4 mode

  * [regression] Colour banding and artefacts appear system-wide on an Asus
Zenbook UX303LA with Intel HD 4400 graphics (LP: #1749420)
- drm/edid: Add 6 bpc quirk for CPT panel in Asus UX303LA

  * DVB Card with SAA7146 chipset not working (LP: #1742316)
- vmalloc: fix __GFP_HIGHMEM usage for vmalloc_32 on 32b systems

  * [Asus UX360UA] battery status in unity-panel is not changing when battery is
being charged (LP: #1661876) // AC adapter status not detected on Asus
ZenBook UX410UAK (LP: #1745032)
- ACPI / battery: Add quirk for Asus UX360UA and UX410UAK

  * ASUS UX305LA - Battery state not detected correctly (LP: #1482390)
- ACPI / battery: Add quirk for Asus GL502VSK and UX305LA

  * support thunderx2 vendor pmu events (LP: #1747523)
- perf pmu: Extract function to get JSON alias map
- perf pmu: Pass pmu as a parameter to get_cpuid_str()
- perf tools arm64: Add support for get_cpuid_str function.
- perf pmu: Add helper function is_pmu_core to detect PMU CORE devices
- perf vendor events arm64: Add ThunderX2 implementation defined pmu core
  events
- perf pmu: Add check for valid cpuid in perf_pmu__find_map()

  * lpfc.ko module doesn't work (LP: #1746970)
- scsi: lpfc: Fix loop mode target discovery

  * Ubuntu 17.10 crashes on vmalloc.c (LP: #1739498)
- powerpc/mm/book3s64: Make KERN_IO_START a variable
- powerpc/mm/slb: Move comment next to the code it's referring to
- powerpc/mm/hash64: Make vmalloc 56T on hash

  * ethtool -p fails to light NIC LED on HiSilicon D05 systems (LP: #1748567)
- net: hns: add ACPI mode support for ethtool -p

  * CVE-2017-17807
- KEYS: add missing permission check for request_key() destination

  * [Artful SRU] Fix capsule update regression (LP: #1746019)
- efi/capsule-loader: Reinstate virtual capsule mapping

  * [Artful/Bionic] [Config] enable EDAC_GHES for ARM64 (LP: #1747746)
- Ubuntu: [Config] enable EDAC_GHES for ARM64

  * linux-tools: perf incorrectly linking libbfd (LP: #1748922)
- SAUCE: tools -- add ability to disable libbfd
- [Packaging] correct disablement of libbfd

  * Cherry pick c96f5471ce7d for delayacct fix (LP: #1747769)
- delayacct: Account blkio completion on the correct task

  * Error in CPU frequency reporting when nominal and min pstates are same
(cpufreq) (LP: #1746174)
- cpufreq: powernv: Dont assume distinct pstate values for nominal and pmin

  * retpoline abi files are empty on i386 (LP: #1751021)
- [Packaging] retpoline-extract -- instantiate retpoline files for i386
- [Packaging] final-checks -- sanity checking ABI contents
- [Packaging] final-checks -- check for empty retpoline files

  * [P9,Power NV][WSP][Ubuntu 1804] : "Kernel access of bad area " when grouping
different pmu events using perf fuzzer . (perf:) (LP: #1746225)
- powerpc/perf: Fix oops when grouping different pmu events

  * bnx2x_attn_int_deasserted3:4323 MC assert! (LP: #1715519) //
CVE-2018-126
- net: create skb_gso_validate_mac_len()
- bnx2x: disable GSO where gso_size is too big for hardware

  * Ubuntu16.04.03: ISAv3 initialize MMU registers before setting partition
table (LP: #1736145)
- powerpc/64s: Initialize ISAv3 MMU registers before setting partition table

  * powerpc/powernv: Flush console before platform error reboot (LP: #1735159)
- 

[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-03-20 Thread Rod Smith
I've tested kernel 4.13.0-38-generic #43-Ubuntu from artful-proposed and
the problem does not occur with that kernel.

** Tags removed: verification-needed-artful
** Tags added: verification-done-artful

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-03-19 Thread Stefan Bader
This bug is awaiting verification that the kernel in -proposed solves
the problem. Please test the kernel and update this bug with the
results. If the problem is solved, change the tag 'verification-needed-
artful' to 'verification-done-artful'. If the problem still exists,
change the tag 'verification-needed-artful' to 'verification-failed-
artful'.

If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: verification-needed-artful

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-03-08 Thread Joseph Salisbury
** Changed in: linux (Ubuntu Artful)
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-03-07 Thread Joseph Salisbury
** Changed in: linux (Ubuntu Artful)
   Status: Fix Committed => In Progress

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  In Progress
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-03-05 Thread Per Allansson
I have similar issues on 16.04.4 with latest HWE kernel - and when
double-checking against the source code I can see that this fix is now
AWOL from:

linux-image-4.13.0-36-generic   4.13.0-36.40~16.04.1

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-02-03 Thread Khaled El Mously
** Changed in: linux (Ubuntu Artful)
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-19 Thread Seth Forshee
** Changed in: linux (Ubuntu Bionic)
   Status: In Progress => Fix Committed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Artful:
  In Progress
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-19 Thread Joseph Salisbury
SRU request submitted for Artful and Bionic.

https://lists.ubuntu.com/archives/kernel-team/2018-January/089403.html

** Description changed:

- In doing Ubuntu 17.10 regression testing, we've encountered one computer
- (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times
- when running our cpu_offlining test. This test attempts to take all the
- CPU cores offline except one, then brings them back online again. This
- test ran successfully on boldore with previous releases, but with 17.10,
- the system sometimes (about one in four runs) hangs. Reverting to Ubuntu
- 16.04.3, I found no problems; but when I upgraded the 16.04.3
- installation to linux-image-4.13.0-16-generic, the problem appeared
- again, so I'm confident this is a problem with the kernel. I'm attaching
- two files, dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show
- the dmesg output that appears when running the cpu_offlining test with
- 4.10.0-38 and 4.13.0-16 kernels, respectively; the system hung on the
- 4.13 run. (I was running "dmesg -w" in a second SSH login; the files are
- cut-and-pasted from that.)
+ == SRU Justification ==
+ The following mainline commit introduced a regression in v4.14-rc1:
+ 24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")
+ 
+ This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
+ ac2fc5adab0f4b.
+ 
+ This bug was causing regression tests to hang about one in four
+ times when running cpu_offlining tests.
+ 
+ This patch to fix this regression was just submitted to mainline, so it is 
also
+ needed in Bionic.
+ 
+ == Fix ==
+ commit d47924417319e3b6a728c0b690f183e75bc2a702
+ Author: Thomas Gleixner 
+ Date:   Tue Jan 16 19:59:59 2018 +0100
+ 
+ x86/intel_rdt/cqm: Prevent use after free
+ 
+ == Regression Potential ==
+ Low.  This patch fixes a current regression that is a use after free.
+ 
+ 
+ ### Original Bug Description ###
+ In doing Ubuntu 17.10 regression testing, we've encountered one computer 
(boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in four times when 
running our cpu_offlining test. This test attempts to take all the CPU cores 
offline except one, then brings them back online again. This test ran 
successfully on boldore with previous releases, but with 17.10, the system 
sometimes (about one in four runs) hangs. Reverting to Ubuntu 16.04.3, I found 
no problems; but when I upgraded the 16.04.3 installation to 
linux-image-4.13.0-16-generic, the problem appeared again, so I'm confident 
this is a problem with the kernel. I'm attaching two files, 
dmesg-output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output 
that appears when running the cpu_offlining test with 4.10.0-38 and 4.13.0-16 
kernels, respectively; the system hung on the 4.13 run. (I was running "dmesg 
-w" in a second SSH login; the files are cut-and-pasted from that.)
  
  I initiated this bug report from an Ubuntu 16.04.3 installation running
  a 4.10 kernel; but as I said, this applies to the 4.13 kernel.
  
  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
-  TERM=xterm-256color
-  PATH=(custom, no user)
-  XDG_RUNTIME_DIR=
-  LANG=en_US.UTF-8
-  SHELL=/bin/bash
+  TERM=xterm-256color
+  PATH=(custom, no user)
+  XDG_RUNTIME_DIR=
+  LANG=en_US.UTF-8
+  SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Artful:
  In Progress
Status in linux source package in Bionic:
  In Progress

Bug description:
  == SRU Justification ==
  The following mainline commit introduced a regression in v4.14-rc1:
  24247aeeabe9 ("x86/intel_rdt/cqm: Improve limbo list processing")

  This commit made it's way into Artful via Launchpad bug 1591609 as Artful 
commit
  ac2fc5adab0f4b.

  This bug was causing regression tests to hang about one in four
  times when running cpu_offlining tests.

  This patch to fix this regression was just submitted to mainline, so it is 
also
  needed in Bionic.

  == Fix ==
  commit d47924417319e3b6a728c0b690f183e75bc2a702
  Author: Thomas Gleixner 
  Date:   Tue Jan 16 19:59:59 2018 +0100

  x86/intel_rdt/cqm: Prevent use after free

  == Regression Potential ==
  Low.  This patch fixes a current regression that is a use after free.


  ### Original Bug Description ###
  In doing Ubuntu 17.10 regression testing, we've 

[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-18 Thread Rod Smith
I ran it half a dozen times with your latest kernel and it seemed fine,
aside from the usual "error -19" messages. To be sure it's the right
one, here's the kernel version information:

ubuntu@oil-boldore:~$ uname -a
Linux oil-boldore 4.13.0-25-generic #29~lp1733662PatchInMainline SMP Thu Jan 18 
15:58:13 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Artful:
  In Progress
Status in linux source package in Bionic:
  In Progress

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-18 Thread Joseph Salisbury
I built one last Artful test kernel with the patch tglx submitted to
mainline. The test kernel can be downloaded from:

http://kernel.ubuntu.com/~jsalisbury/lp1733662

Can you test this kernel and confirm it resolves the bug?

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Artful:
  In Progress
Status in linux source package in Bionic:
  In Progress

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-18 Thread Joseph Salisbury
** No longer affects: linux-hwe (Ubuntu)

** No longer affects: linux-hwe (Ubuntu Artful)

** No longer affects: linux-hwe (Ubuntu Bionic)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux source package in Artful:
  In Progress
Status in linux source package in Bionic:
  In Progress

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-17 Thread Rod Smith
That seems to have fixed it! I've run the test script six or seven times
on both kernels, with nary a hiccup (aside from the "error -19" messages
with the 4.13 kernel). Below is the reported kernel information from
both your builds, just to be sure I booted the correct kernels.

$ uname -a
Linux oil-boldore 4.13.0-25-generic #29~lp1733662PatchFromUpstream SMP Wed Jan 
17 20:13:36 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

$ uname -a
Linux oil-boldore 4.15.0-041500rc8-generic #201801172011 SMP Wed Jan 17 
20:13:51 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux-hwe package in Ubuntu:
  Confirmed
Status in linux source package in Artful:
  In Progress
Status in linux-hwe source package in Artful:
  Confirmed
Status in linux source package in Bionic:
  In Progress
Status in linux-hwe source package in Bionic:
  Confirmed

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-17 Thread Joseph Salisbury
I built Artful and mainline test kernels with the patch from tglx.  The
test kernels can be downloaded from:

Artful: http://kernel.ubuntu.com/~jsalisbury/lp1733662/artful
mainline: http://kernel.ubuntu.com/~jsalisbury/lp1733662/mainline

Can you test these kernels out and see if they resolve the bug?

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux-hwe package in Ubuntu:
  Confirmed
Status in linux source package in Artful:
  In Progress
Status in linux-hwe source package in Artful:
  Confirmed
Status in linux source package in Bionic:
  In Progress
Status in linux-hwe source package in Bionic:
  Confirmed

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-16 Thread Rod Smith
Joseph,

The first run of your latest kernel completed; however, I noticed the
following in the dmesg output:

[  426.281083] 
==
[  426.286615] BUG: KASAN: use-after-free in find_first_bit+0x1f/0x80
[  426.291841] Read of size 8 at addr 883ff7c1e780 by task cpuhp/31/195

[  426.302209] CPU: 31 PID: 195 Comm: cpuhp/31 Not tainted 4.13.0-25-generic 
#29~lp1733662KASANenabled
[  426.302213] Hardware name: Cisco Systems Inc UCSC-C240-M4L/UCSC-C240-M4L, 
BIOS C240M4.2.0.10c.0.032320160820 03/23/2016
[  426.302215] Call Trace:
[  426.302233]  dump_stack+0xb8/0x12d
[  426.302241]  ? dma_virt_map_sg+0xd3/0xd3
[  426.302252]  ? show_regs_print_info+0x41/0x41
[  426.302263]  print_address_description+0x6f/0x280
[  426.302269]  kasan_report+0x27a/0x370
[  426.302276]  ? find_first_bit+0x1f/0x80
[  426.302288]  __asan_load8+0x54/0x90
[  426.302295]  find_first_bit+0x1f/0x80
[  426.302306]  has_busy_rmid+0x47/0x70
[  426.302314]  intel_rdt_offline_cpu+0x4b4/0x510
[  426.302321]  ? clear_closid_rmid.isra.4+0x70/0x70
[  426.302333]  ? sysfs_remove_group+0x7a/0xc0
[  426.302339]  ? clear_closid_rmid.isra.4+0x70/0x70
[  426.302351]  cpuhp_invoke_callback+0x15f/0x7e0
[  426.302360]  ? cpuhp_kick_ap_work+0x2d0/0x2d0
[  426.302372]  ? __schedule+0x4f1/0xeb0
[  426.302377]  ? cpuhp_kick_ap_work+0x2d0/0x2d0
[  426.302385]  ? firmware_map_remove+0x1b1/0x1b1
[  426.302395]  ? migrate_swap_stop+0x2f0/0x2f0
[  426.302402]  ? firmware_map_remove+0x1b1/0x1b1
[  426.302407]  ? migrate_swap_stop+0x2f0/0x2f0
[  426.302414]  ? schedule+0xd8/0x2a0
[  426.302421]  ? __schedule+0xeb0/0xeb0
[  426.302427]  ? default_wake_function+0x2f/0x40
[  426.302439]  ? __wake_up_common+0xa1/0xc0
[  426.302446]  cpuhp_down_callbacks+0x52/0xa0
[  426.302453]  cpuhp_thread_fun+0x117/0x1a0
[  426.302459]  ? cpu_up+0x20/0x20
[  426.302468]  smpboot_thread_fn+0x20e/0x2f0
[  426.302474]  ? sort_range+0x30/0x30
[  426.302482]  kthread+0x1b7/0x1e0
[  426.302488]  ? sort_range+0x30/0x30
[  426.302493]  ? kthread_create_on_node+0xc0/0xc0
[  426.302500]  ret_from_fork+0x1f/0x30

[  426.307683] Allocated by task 56:
[  426.312817]  save_stack_trace+0x1b/0x20
[  426.312824]  save_stack+0x43/0xd0
[  426.312829]  kasan_kmalloc+0xad/0xe0
[  426.312834]  __kmalloc+0x105/0x230
[  426.312840]  intel_rdt_online_cpu+0x5a8/0x830
[  426.312846]  cpuhp_invoke_callback+0x15f/0x7e0
[  426.312850]  cpuhp_thread_fun+0x8b/0x1a0
[  426.312856]  smpboot_thread_fn+0x20e/0x2f0
[  426.312861]  kthread+0x1b7/0x1e0
[  426.312866]  ret_from_fork+0x1f/0x30

[  426.317887] Freed by task 195:
[  426.322879]  save_stack_trace+0x1b/0x20
[  426.322887]  save_stack+0x43/0xd0
[  426.322891]  kasan_slab_free+0x72/0xc0
[  426.322896]  kfree+0x94/0x1a0
[  426.322902]  intel_rdt_offline_cpu+0x17d/0x510
[  426.322908]  cpuhp_invoke_callback+0x15f/0x7e0
[  426.322912]  cpuhp_down_callbacks+0x52/0xa0
[  426.322917]  cpuhp_thread_fun+0x117/0x1a0
[  426.322925]  smpboot_thread_fn+0x20e/0x2f0
[  426.322929]  kthread+0x1b7/0x1e0
[  426.322935]  ret_from_fork+0x1f/0x30

[  426.327837] The buggy address belongs to the object at 883ff7c1e780
which belongs to the cache kmalloc-8 of size 8
[  426.338289] The buggy address is located 0 bytes inside of
8-byte region [883ff7c1e780, 883ff7c1e788)
[  426.348805] The buggy address belongs to the page:
[  426.354223] page:ea00ffdf0780 count:1 mapcount:0 mapping:  
(null) index:0x0
[  426.359838] flags: 0x57c100(slab)
[  426.365373] raw: 0057c100   
000100aa00aa
[  426.371135] raw: dead0100 dead0200 8817f500fb80 

[  426.377004] page dumped because: kasan: bad access detected

[  426.388626] Memory state around the buggy address:
[  426.394498]  883ff7c1e680: fc fc 00 fc fc fb fc fc 00 fc fc fb fc fc 00 
fc
[  426.400634]  883ff7c1e700: fc 00 fc fc fb fc fc 00 fc fc fb fc fc fb fc 
fc
[  426.406721] >883ff7c1e780: fb fc fc fb fc fc fb fc fc 00 fc fc fb fc fc 
fb
[  426.412737]^
[  426.418698]  883ff7c1e800: fc fc fb fc fc fb fc fc fb fc fc fb fc fc fb 
fc
[  426.424961]  883ff7c1e880: fc 00 fc fc fb fc fc fb fc fc fb fc fc fb fc 
fc
[  426.431154] 
==
[  426.437413] Disabling lock debugging due to kernel taint
[  426.472795] IRQ 8: no longer affine to CPU31
[  426.472806] IRQ 9: no longer affine to CPU31
[  426.472827] IRQ 40: no longer affine to CPU31
[  426.473962] smpboot: CPU 31 is now offline

I ran it several more times without any obvious errors; however, I might
have missed something. (The dmesg output is quite verbose and scrolls by
quickly!)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to 

[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-15 Thread Joseph Salisbury
Hi Rod,

I built an Artful test kernel with KASAN enable.

The test kernel can be downloaded from:
http://kernel.ubuntu.com/~jsalisbury/lp1733662

Can you test this kernel as requested by upstream?

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux-hwe package in Ubuntu:
  Confirmed
Status in linux source package in Artful:
  In Progress
Status in linux-hwe source package in Artful:
  Confirmed
Status in linux source package in Bionic:
  In Progress
Status in linux-hwe source package in Bionic:
  Confirmed

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1733662] Re: System hang with Linux kernel due to mainline commit 24247aeeabe

2018-01-12 Thread Joseph Salisbury
Hi Vikas,

A kernel bug report was opened against Ubuntu [0].  After a kernel
bisect, it was found that reverting the following commit resolved this bug:

commit 24247aeeabe99eab13b798c2dec066dd6f07
Author: Vikas Shivappa 
Date:   Tue Aug 15 18:00:43 2017 -0700

    x86/intel_rdt/cqm: Improve limbo list processing


The regression was introduced as of v4.14-r1 and still exists with
current mainline.  The trace with v4.15-rc7 is in comment #44[1].

I was hoping to get your feedback, since you are the patch author.  Do
you think gathering any additional data will help diagnose this issue,
or would it be best to submit a revert request?


Thanks,

Joe
[0] http://pad.lv/1733662
[1]
https://bugs.launchpad.net/ubuntu/+source/linux-hwe/+bug/1733662/comments/44



** Summary changed:

- System hang with Linux kernel 4.13, not with 4.10
+ System hang with Linux kernel due to mainline commit 24247aeeabe

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1733662

Title:
  System hang with Linux kernel due to mainline commit 24247aeeabe

Status in linux package in Ubuntu:
  In Progress
Status in linux-hwe package in Ubuntu:
  Confirmed
Status in linux source package in Artful:
  In Progress
Status in linux-hwe source package in Artful:
  Confirmed
Status in linux source package in Bionic:
  In Progress
Status in linux-hwe source package in Bionic:
  Confirmed

Bug description:
  In doing Ubuntu 17.10 regression testing, we've encountered one
  computer (boldore, a Cisco UCS C240 M4 [VIC]), that hangs about one in
  four times when running our cpu_offlining test. This test attempts to
  take all the CPU cores offline except one, then brings them back
  online again. This test ran successfully on boldore with previous
  releases, but with 17.10, the system sometimes (about one in four
  runs) hangs. Reverting to Ubuntu 16.04.3, I found no problems; but
  when I upgraded the 16.04.3 installation to linux-
  image-4.13.0-16-generic, the problem appeared again, so I'm confident
  this is a problem with the kernel. I'm attaching two files, dmesg-
  output-4.10.txt and dmesg-output-4.13.txt, which show the dmesg output
  that appears when running the cpu_offlining test with 4.10.0-38 and
  4.13.0-16 kernels, respectively; the system hung on the 4.13 run. (I
  was running "dmesg -w" in a second SSH login; the files are cut-and-
  pasted from that.)

  I initiated this bug report from an Ubuntu 16.04.3 installation
  running a 4.10 kernel; but as I said, this applies to the 4.13 kernel.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.10.0-38-generic 4.10.0-38.42~16.04.1
  ProcVersionSignature: User Name 4.10.0-38.42~16.04.1-generic 4.10.17
  Uname: Linux 4.10.0-38-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.10
  Architecture: amd64
  Date: Tue Nov 21 17:36:06 2017
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1733662/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp