** Tags added: cscc
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1719697
Title:
Boot/Installation crash of Ubuntu-16.04.3 HWE kernel on R940
Status in dellserver:
Fix Released
Status in linux package in Ubuntu:
Fix Released
Status in linux source package in Zesty:
Fix Released
Bug description:
== SRU Justification ==
Kernel crashes when installation of Ubuntu-16.04.3 with HWE (ISO).
Same observation while booting to 4.10.0-28 HWE kerenl of Ubuntu-16.04.3
and 4.10.0-33 HWE as well.
Seen only with 4.10 HWE kernels of Ubuntu-16.04.3. 4.4 kernels of
Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine.
Reducing the core count to <26 cores helps here. Boot & installation of
HWE-kernel works fine.
This bug was introduced by commit:
dc6db24d2476 ("x86/acpi: Set persistent cpuid <-> nodeid mapping when
booting")
It is resolved by reverting commit dc6db24d2476, which was done in mainline
by
commit c962cff17df as of v4.11-rc3.
There are three additiona commits introduced by the same patch author when
commit c962cff17df was submitted. However, it was confirmed that only the
single
revert is needed to fix this particular bug. Upstream thread:
https://lkml.org/lkml/2017/2/20/66
== Fix ==
commit c962cff17dfa11f4a8227ac16de2b28aea3312e4
Author: Dou Liyang <[email protected]>
Date: Fri Mar 3 16:02:23 2017 +0800
Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when
booting"
== Regression Potential ==
This is reverting a commit that introduced a bug. This commit has also
been reverted upstream.
== Test Case ==
A test kernel was built with this patch and tested by the original bug
reporter.
The bug reporter states the test kernel resolved the bug.
Setup:
Dell PowerEdge R940 having 2 socket populated with 28 cores.
Impact:
This is a boot and installation failure for R940 users having 2 socket x 28
cores.
Requesting an SRU for the fix.
Steps:
1. Setup a Dell PowerEdge R940 with 28 cores CPUs on 2 sockets.
2. Start installation of Ubuntu-16.04.3 with HWE kernel.
3. Observe the screen remains blank. Console logs indicate kernel crash.
Additional Info:
* Seen only with 4.10 HWE kernels of Ubuntu-16.04.3.
4.4 kernels of Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server
17.10 works fine.
* Reducing the core count to <26 cores helps here. Boot & installation
of HWE-kernel works fine.
* Attaching the console log and acpidump from the setup.
* Patch causing this failure:
https://github.com/torvalds/linux/commit/dc6db24d2476cd09c0ecf2b8d80313539f737a89
x86/acpi: Set persistent cpuid <-> nodeid mapping when booting
* Fix patch series: https://lkml.org/lkml/2017/2/20/66
Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting"
https://github.com/torvalds/linux/commit/c962cff17dfa11f4a8227ac16de2b28aea3312e4#diff-6bd7ef719bca1a2a56b9ebf4bd0bd88d
Revert"x86/acpi: Enable MADT APIs to return disabled apicids"
https://github.com/torvalds/linux/commit/09c3f2bd5c7e5f18687663acb6adc6b167484ca5
acpi/processor: Implement DEVICE operator for processor enumeration
https://github.com/torvalds/linux/commit/8c8cb30f49b86333d8e036e1945cf1a78c03577e
acpi/processor: Check for duplicate processor ids at hotplug time
https://github.com/torvalds/linux/commit/a77d6cd968497792e072b74dff45b891ba778ddb
To manage notifications about this bug go to:
https://bugs.launchpad.net/dellserver/+bug/1719697/+subscriptions
--
Mailing list: https://launchpad.net/~kernel-packages
Post to : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help : https://help.launchpad.net/ListHelp