This bug is awaiting verification that the kernel in -proposed solves
the problem. Please test the kernel and update this bug with the
results. If the problem is solved, change the tag 'verification-needed-
bionic' to 'verification-done-bionic'. If the problem still exists,
change the tag 'verification-needed-bionic' to 'verification-failed-
bionic'.

If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: verification-needed-bionic

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1796443

Title:
  HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is
  set on command line

Status in Linux:
  Fix Released
Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Bionic:
  Fix Committed
Status in linux source package in Cosmic:
  Won't Fix
Status in linux source package in Disco:
  Fix Committed

Bug description:
  === SRU Justification ===
  [Impact]
  System doesn't boot without "mce=off".

  [Fix]
  Quote from the commit log:
  "Clear the "Counter Present" bit in the Instruction Fetch bank's
  MCA_MISC0 register. This will prevent enabling MCA thresholding on this
  bank which will prevent the high interrupt rate due to this error."

  [Test]
  The affected user reported these commits fix the issue.

  [Regression Potential]
  Low. Upstream stable commits. I don't see any regression on my
  unaffected AMD systems.

  === Original Bug Report ===
  My new Elitebook, with the latest bios 1.03.01, refuses to boot any kernel 
later than 4.10 unless mce=off is appended to the kernel command line.  As in, 
there are no kernel messages at all after grub (yes, quiet and splash were 
removed from the command line).  Perhaps it crashes before the efifb kicks in?

  System operates fine if mce=off is added to the kernel command line
  (and iommu=soft, but that's a separate issue, and fails with kernel
  output in that case).

  I opened upstream bug here :
  https://bugzilla.kernel.org/show_bug.cgi?id=201291

  I bisected the problem down to this commit (and the few before it,
  which also added extra MCE output, but didn't actually crash):

      18807ddb7f88d4ac3797302bafb18143d573e66f is the first bad commit
      commit 18807ddb7f88d4ac3797302bafb18143d573e66f
      Author: Yazen Ghannam <yazen.ghan...@amd.com>
      Date:   Tue Nov 15 15:13:53 2016 -0600

      x86/mce/AMD: Reset Threshold Limit after logging error

      The error count field in MCA_MISC does not get reset by hardware when the
      threshold has been reached. Software is expected to reset it. Currently,
      the threshold limit only gets reset during init or when a user writes to
      sysfs.

      If the user is not monitoring threshold interrupts and resetting
      the limit then the user will only see 1 interrupt when the limit is first
      hit. So if, for example, the limit is set to 10 then only 1 interrupt will
      be recorded after 10 errors even if 100 errors have occurred. The user may
      then assume that only 10 errors have occurred.

  There are threads online about this being related to the latest bios.
  The upstream bug has acpidump attached.

  ProblemType: Bug
  DistroRelease: Ubuntu 18.10
  Package: linux-image-4.18.0-8-generic 4.18.0-8.9
  ProcVersionSignature: Ubuntu 4.18.0-8.9-generic 4.18.7
  Uname: Linux 4.18.0-8-generic x86_64
  ApportVersion: 2.20.10-0ubuntu11
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC1:  john       2015 F.... pulseaudio
   /dev/snd/pcmC1D0p:   john       2015 F...m pulseaudio
   /dev/snd/controlC0:  john       2015 F.... pulseaudio
  CurrentDesktop: ubuntu:GNOME
  Date: Fri Oct  5 23:24:45 2018
  InstallationDate: Installed on 2018-09-30 (5 days ago)
  InstallationMedia: Ubuntu 18.10 "Cosmic Cuttlefish" - Beta amd64 (20180927)
  Lsusb:
   Bus 005 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 004 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 003 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  MachineType: HP HP EliteBook 745 G5
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.18.0-8-generic 
root=UUID=5cf73665-d2a3-4203-80fd-659faf1afea4 ro quiet splash iommu=soft 
mce=off
  RelatedPackageVersions:
   linux-restricted-modules-4.18.0-8-generic N/A
   linux-backports-modules-4.18.0-8-generic  N/A
   linux-firmware                            1.175
  RfKill:
   1: phy0: Wireless LAN
    Soft blocked: no
    Hard blocked: no
  SourcePackage: linux
  StagingDrivers: r8822be
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 07/26/2018
  dmi.bios.vendor: HP
  dmi.bios.version: Q81 Ver. 01.03.01
  dmi.board.name: 83D5
  dmi.board.vendor: HP
  dmi.board.version: KBC Version 08.47.00
  dmi.chassis.asset.tag: 5CG838305Y
  dmi.chassis.type: 10
  dmi.chassis.vendor: HP
  dmi.modalias: 
dmi:bvnHP:bvrQ81Ver.01.03.01:bd07/26/2018:svnHP:pnHPEliteBook745G5:pvr:rvnHP:rn83D5:rvrKBCVersion08.47.00:cvnHP:ct10:cvr:
  dmi.product.family: 103C_5336AN HP EliteBook
  dmi.product.name: HP EliteBook 745 G5
  dmi.product.sku: 2MG23AV
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1796443/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to