[Kernel-packages] [Bug 1796443] Re: HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is set on command line

2018-10-09 Thread John Clemens
Changing as requested.

** Changed in: linux (Ubuntu)
   Status: Triaged => Confirmed

** Tags added: kernel-bug-exists-upstream

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1796443

Title:
  HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is
  set on command line

Status in Linux:
  Unknown
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  My new Elitebook, with the latest bios 1.03.01, refuses to boot any
  kernel later than 4.10 unless mce=off is appended to the kernel
  command line.  As in, there are no kernel messages at all after grub
  (yes, quiet and splash were removed from the command line).  Perhaps
  it crashes before the efifb kicks in?

  System operates fine if mce=off is added to the kernel command line
  (and iommu=soft, but that's a separate issue, and fails with kernel
  output in that case).

  I opened upstream bug here :
  https://bugzilla.kernel.org/show_bug.cgi?id=201291

  I bisected the problem down to this commit (and the few before it,
  which also added extra MCE output, but didn't actually crash):

  18807ddb7f88d4ac3797302bafb18143d573e66f is the first bad commit
  commit 18807ddb7f88d4ac3797302bafb18143d573e66f
  Author: Yazen Ghannam 
  Date:   Tue Nov 15 15:13:53 2016 -0600

  x86/mce/AMD: Reset Threshold Limit after logging error

  The error count field in MCA_MISC does not get reset by hardware when the
  threshold has been reached. Software is expected to reset it. Currently,
  the threshold limit only gets reset during init or when a user writes to
  sysfs.

  If the user is not monitoring threshold interrupts and resetting
  the limit then the user will only see 1 interrupt when the limit is first
  hit. So if, for example, the limit is set to 10 then only 1 interrupt will
  be recorded after 10 errors even if 100 errors have occurred. The user may
  then assume that only 10 errors have occurred.

  There are threads online about this being related to the latest bios.
  The upstream bug has acpidump attached.

  ProblemType: Bug
  DistroRelease: Ubuntu 18.10
  Package: linux-image-4.18.0-8-generic 4.18.0-8.9
  ProcVersionSignature: Ubuntu 4.18.0-8.9-generic 4.18.7
  Uname: Linux 4.18.0-8-generic x86_64
  ApportVersion: 2.20.10-0ubuntu11
  Architecture: amd64
  AudioDevicesInUse:
   USERPID ACCESS COMMAND
   /dev/snd/controlC1:  john   2015 F pulseaudio
   /dev/snd/pcmC1D0p:   john   2015 F...m pulseaudio
   /dev/snd/controlC0:  john   2015 F pulseaudio
  CurrentDesktop: ubuntu:GNOME
  Date: Fri Oct  5 23:24:45 2018
  InstallationDate: Installed on 2018-09-30 (5 days ago)
  InstallationMedia: Ubuntu 18.10 "Cosmic Cuttlefish" - Beta amd64 (20180927)
  Lsusb:
   Bus 005 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 004 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 003 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  MachineType: HP HP EliteBook 745 G5
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.18.0-8-generic 
root=UUID=5cf73665-d2a3-4203-80fd-659faf1afea4 ro quiet splash iommu=soft 
mce=off
  RelatedPackageVersions:
   linux-restricted-modules-4.18.0-8-generic N/A
   linux-backports-modules-4.18.0-8-generic  N/A
   linux-firmware1.175
  RfKill:
   1: phy0: Wireless LAN
    Soft blocked: no
    Hard blocked: no
  SourcePackage: linux
  StagingDrivers: r8822be
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 07/26/2018
  dmi.bios.vendor: HP
  dmi.bios.version: Q81 Ver. 01.03.01
  dmi.board.name: 83D5
  dmi.board.vendor: HP
  dmi.board.version: KBC Version 08.47.00
  dmi.chassis.asset.tag: 5CG838305Y
  dmi.chassis.type: 10
  dmi.chassis.vendor: HP
  dmi.modalias: 
dmi:bvnHP:bvrQ81Ver.01.03.01:bd07/26/2018:svnHP:pnHPEliteBook745G5:pvr:rvnHP:rn83D5:rvrKBCVersion08.47.00:cvnHP:ct10:cvr:
  dmi.product.family: 103C_5336AN HP EliteBook
  dmi.product.name: HP EliteBook 745 G5
  dmi.product.sku: 2MG23AV
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1796443/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1796443] Re: HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is set on command line

2018-10-09 Thread John Clemens
Kernel

linux-image-unsigned-4.19.0-999-generic_4.19.0-999.201810062200_amd64
still has this issue.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1796443

Title:
  HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is
  set on command line

Status in Linux:
  Unknown
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  My new Elitebook, with the latest bios 1.03.01, refuses to boot any
  kernel later than 4.10 unless mce=off is appended to the kernel
  command line.  As in, there are no kernel messages at all after grub
  (yes, quiet and splash were removed from the command line).  Perhaps
  it crashes before the efifb kicks in?

  System operates fine if mce=off is added to the kernel command line
  (and iommu=soft, but that's a separate issue, and fails with kernel
  output in that case).

  I opened upstream bug here :
  https://bugzilla.kernel.org/show_bug.cgi?id=201291

  I bisected the problem down to this commit (and the few before it,
  which also added extra MCE output, but didn't actually crash):

  18807ddb7f88d4ac3797302bafb18143d573e66f is the first bad commit
  commit 18807ddb7f88d4ac3797302bafb18143d573e66f
  Author: Yazen Ghannam 
  Date:   Tue Nov 15 15:13:53 2016 -0600

  x86/mce/AMD: Reset Threshold Limit after logging error

  The error count field in MCA_MISC does not get reset by hardware when the
  threshold has been reached. Software is expected to reset it. Currently,
  the threshold limit only gets reset during init or when a user writes to
  sysfs.

  If the user is not monitoring threshold interrupts and resetting
  the limit then the user will only see 1 interrupt when the limit is first
  hit. So if, for example, the limit is set to 10 then only 1 interrupt will
  be recorded after 10 errors even if 100 errors have occurred. The user may
  then assume that only 10 errors have occurred.

  There are threads online about this being related to the latest bios.
  The upstream bug has acpidump attached.

  ProblemType: Bug
  DistroRelease: Ubuntu 18.10
  Package: linux-image-4.18.0-8-generic 4.18.0-8.9
  ProcVersionSignature: Ubuntu 4.18.0-8.9-generic 4.18.7
  Uname: Linux 4.18.0-8-generic x86_64
  ApportVersion: 2.20.10-0ubuntu11
  Architecture: amd64
  AudioDevicesInUse:
   USERPID ACCESS COMMAND
   /dev/snd/controlC1:  john   2015 F pulseaudio
   /dev/snd/pcmC1D0p:   john   2015 F...m pulseaudio
   /dev/snd/controlC0:  john   2015 F pulseaudio
  CurrentDesktop: ubuntu:GNOME
  Date: Fri Oct  5 23:24:45 2018
  InstallationDate: Installed on 2018-09-30 (5 days ago)
  InstallationMedia: Ubuntu 18.10 "Cosmic Cuttlefish" - Beta amd64 (20180927)
  Lsusb:
   Bus 005 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 004 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 003 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  MachineType: HP HP EliteBook 745 G5
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.18.0-8-generic 
root=UUID=5cf73665-d2a3-4203-80fd-659faf1afea4 ro quiet splash iommu=soft 
mce=off
  RelatedPackageVersions:
   linux-restricted-modules-4.18.0-8-generic N/A
   linux-backports-modules-4.18.0-8-generic  N/A
   linux-firmware1.175
  RfKill:
   1: phy0: Wireless LAN
    Soft blocked: no
    Hard blocked: no
  SourcePackage: linux
  StagingDrivers: r8822be
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 07/26/2018
  dmi.bios.vendor: HP
  dmi.bios.version: Q81 Ver. 01.03.01
  dmi.board.name: 83D5
  dmi.board.vendor: HP
  dmi.board.version: KBC Version 08.47.00
  dmi.chassis.asset.tag: 5CG838305Y
  dmi.chassis.type: 10
  dmi.chassis.vendor: HP
  dmi.modalias: 
dmi:bvnHP:bvrQ81Ver.01.03.01:bd07/26/2018:svnHP:pnHPEliteBook745G5:pvr:rvnHP:rn83D5:rvrKBCVersion08.47.00:cvnHP:ct10:cvr:
  dmi.product.family: 103C_5336AN HP EliteBook
  dmi.product.name: HP EliteBook 745 G5
  dmi.product.sku: 2MG23AV
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1796443/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1796443] Re: HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is set on command line

2018-10-05 Thread John Clemens
Note: bug also effect 18.04. Debian stable works, as it's based on 4.9.

** Description changed:

  My new Elitebook, with the latest bios 1.03.01, refuses to boot any
  kernel later than 4.10 unless mce=off is appended to the kernel command
  line.  As in, there are no kernel messages at all after grub (yes, quiet
  and splash were removed from the command line).  Perhaps it crashes
  before the efifb kicks in?
  
  System operates fine if mce=off is added to the kernel command line (and
  iommu=soft, but that's a separate issue, and fails with kernel output in
  that case).
  
  I opened upstream bug here :
  https://bugzilla.kernel.org/show_bug.cgi?id=201291
  
  I bisected the problem down to this commit (and the few before it, which
- also added extra MCE output, but didn't actually crash.
+ also added extra MCE output, but didn't actually crash):
  
- 18807ddb7f88d4ac3797302bafb18143d573e66f is the first bad commit
- commit 18807ddb7f88d4ac3797302bafb18143d573e66f
- Author: Yazen Ghannam 
- Date:   Tue Nov 15 15:13:53 2016 -0600
+ 18807ddb7f88d4ac3797302bafb18143d573e66f is the first bad commit
+ commit 18807ddb7f88d4ac3797302bafb18143d573e66f
+ Author: Yazen Ghannam 
+ Date:   Tue Nov 15 15:13:53 2016 -0600
  
- x86/mce/AMD: Reset Threshold Limit after logging error
- 
- The error count field in MCA_MISC does not get reset by hardware when the
- threshold has been reached. Software is expected to reset it. Currently,
- the threshold limit only gets reset during init or when a user writes to
- sysfs.
- 
- If the user is not monitoring threshold interrupts and resetting
- the limit then the user will only see 1 interrupt when the limit is first
- hit. So if, for example, the limit is set to 10 then only 1 interrupt will
- be recorded after 10 errors even if 100 errors have occurred. The user may
- then assume that only 10 errors have occurred.
+ x86/mce/AMD: Reset Threshold Limit after logging error
+ 
+ The error count field in MCA_MISC does not get reset by hardware when the
+ threshold has been reached. Software is expected to reset it. Currently,
+ the threshold limit only gets reset during init or when a user writes to
+ sysfs.
+ 
+ If the user is not monitoring threshold interrupts and resetting
+ the limit then the user will only see 1 interrupt when the limit is first
+ hit. So if, for example, the limit is set to 10 then only 1 interrupt will
+ be recorded after 10 errors even if 100 errors have occurred. The user may
+ then assume that only 10 errors have occurred.
  
  There are threads online about this being related to the latest bios.
  The upstream bug has acpidump attached.
  
  ProblemType: Bug
  DistroRelease: Ubuntu 18.10
  Package: linux-image-4.18.0-8-generic 4.18.0-8.9
  ProcVersionSignature: Ubuntu 4.18.0-8.9-generic 4.18.7
  Uname: Linux 4.18.0-8-generic x86_64
  ApportVersion: 2.20.10-0ubuntu11
  Architecture: amd64
  AudioDevicesInUse:
-  USERPID ACCESS COMMAND
-  /dev/snd/controlC1:  john   2015 F pulseaudio
-  /dev/snd/pcmC1D0p:   john   2015 F...m pulseaudio
-  /dev/snd/controlC0:  john   2015 F pulseaudio
+  USERPID ACCESS COMMAND
+  /dev/snd/controlC1:  john   2015 F pulseaudio
+  /dev/snd/pcmC1D0p:   john   2015 F...m pulseaudio
+  /dev/snd/controlC0:  john   2015 F pulseaudio
  CurrentDesktop: ubuntu:GNOME
  Date: Fri Oct  5 23:24:45 2018
  InstallationDate: Installed on 2018-09-30 (5 days ago)
  InstallationMedia: Ubuntu 18.10 "Cosmic Cuttlefish" - Beta amd64 (20180927)
  Lsusb:
-  Bus 005 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
-  Bus 004 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
-  Bus 003 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
-  Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
-  Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
+  Bus 005 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
+  Bus 004 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
+  Bus 003 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
+  Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
+  Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  MachineType: HP HP EliteBook 745 G5
  ProcEnviron:
-  TERM=xterm-256color
-  PATH=(custom, no user)
-  XDG_RUNTIME_DIR=
-  LANG=en_US.UTF-8
-  SHELL=/bin/bash
+  TERM=xterm-256color
+  PATH=(custom, no user)
+  XDG_RUNTIME_DIR=
+  LANG=en_US.UTF-8
+  SHELL=/bin/bash
  ProcFB: 0 amdgpudrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.18.0-8-generic 
root=UUID=5cf73665-d2a3-4203-80fd-659faf1afea4 ro quiet splash iommu=soft 
mce=off
  RelatedPackageVersions:
-  linux-restricted-modules-4.18.0-8-generic N/A
-  linux-backports-modules-4.18.0-8-generic  N/A
-  linux-firmware1.175
+  linux-restricted-modules-4.18.0-8-generic N/A
+  

[Kernel-packages] [Bug 1796443] [NEW] HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is set on command line

2018-10-05 Thread John Clemens
Public bug reported:

My new Elitebook, with the latest bios 1.03.01, refuses to boot any
kernel later than 4.10 unless mce=off is appended to the kernel command
line.  As in, there are no kernel messages at all after grub (yes, quiet
and splash were removed from the command line).  Perhaps it crashes
before the efifb kicks in?

System operates fine if mce=off is added to the kernel command line (and
iommu=soft, but that's a separate issue, and fails with kernel output in
that case).

I opened upstream bug here :
https://bugzilla.kernel.org/show_bug.cgi?id=201291

I bisected the problem down to this commit (and the few before it, which
also added extra MCE output, but didn't actually crash.

18807ddb7f88d4ac3797302bafb18143d573e66f is the first bad commit
commit 18807ddb7f88d4ac3797302bafb18143d573e66f
Author: Yazen Ghannam 
Date:   Tue Nov 15 15:13:53 2016 -0600

x86/mce/AMD: Reset Threshold Limit after logging error

The error count field in MCA_MISC does not get reset by hardware when the
threshold has been reached. Software is expected to reset it. Currently,
the threshold limit only gets reset during init or when a user writes to
sysfs.

If the user is not monitoring threshold interrupts and resetting
the limit then the user will only see 1 interrupt when the limit is first
hit. So if, for example, the limit is set to 10 then only 1 interrupt will
be recorded after 10 errors even if 100 errors have occurred. The user may
then assume that only 10 errors have occurred.

There are threads online about this being related to the latest bios.
The upstream bug has acpidump attached.

ProblemType: Bug
DistroRelease: Ubuntu 18.10
Package: linux-image-4.18.0-8-generic 4.18.0-8.9
ProcVersionSignature: Ubuntu 4.18.0-8.9-generic 4.18.7
Uname: Linux 4.18.0-8-generic x86_64
ApportVersion: 2.20.10-0ubuntu11
Architecture: amd64
AudioDevicesInUse:
 USERPID ACCESS COMMAND
 /dev/snd/controlC1:  john   2015 F pulseaudio
 /dev/snd/pcmC1D0p:   john   2015 F...m pulseaudio
 /dev/snd/controlC0:  john   2015 F pulseaudio
CurrentDesktop: ubuntu:GNOME
Date: Fri Oct  5 23:24:45 2018
InstallationDate: Installed on 2018-09-30 (5 days ago)
InstallationMedia: Ubuntu 18.10 "Cosmic Cuttlefish" - Beta amd64 (20180927)
Lsusb:
 Bus 005 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
 Bus 004 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
 Bus 003 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
 Bus 002 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
 Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
MachineType: HP HP EliteBook 745 G5
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 XDG_RUNTIME_DIR=
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.18.0-8-generic 
root=UUID=5cf73665-d2a3-4203-80fd-659faf1afea4 ro quiet splash iommu=soft 
mce=off
RelatedPackageVersions:
 linux-restricted-modules-4.18.0-8-generic N/A
 linux-backports-modules-4.18.0-8-generic  N/A
 linux-firmware1.175
RfKill:
 1: phy0: Wireless LAN
Soft blocked: no
Hard blocked: no
SourcePackage: linux
StagingDrivers: r8822be
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 07/26/2018
dmi.bios.vendor: HP
dmi.bios.version: Q81 Ver. 01.03.01
dmi.board.name: 83D5
dmi.board.vendor: HP
dmi.board.version: KBC Version 08.47.00
dmi.chassis.asset.tag: 5CG838305Y
dmi.chassis.type: 10
dmi.chassis.vendor: HP
dmi.modalias: 
dmi:bvnHP:bvrQ81Ver.01.03.01:bd07/26/2018:svnHP:pnHPEliteBook745G5:pvr:rvnHP:rn83D5:rvrKBCVersion08.47.00:cvnHP:ct10:cvr:
dmi.product.family: 103C_5336AN HP EliteBook
dmi.product.name: HP EliteBook 745 G5
dmi.product.sku: 2MG23AV
dmi.sys.vendor: HP

** Affects: linux (Ubuntu)
 Importance: Undecided
 Status: New


** Tags: amd64 apport-bug cosmic staging

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1796443

Title:
  HP EliteBook 745 G5 (Ryzen 2500U) fails to boot unless `mce=off` is
  set on command line

Status in linux package in Ubuntu:
  New

Bug description:
  My new Elitebook, with the latest bios 1.03.01, refuses to boot any
  kernel later than 4.10 unless mce=off is appended to the kernel
  command line.  As in, there are no kernel messages at all after grub
  (yes, quiet and splash were removed from the command line).  Perhaps
  it crashes before the efifb kicks in?

  System operates fine if mce=off is added to the kernel command line
  (and iommu=soft, but that's a separate issue, and fails with kernel
  output in that case).

  I opened upstream bug here :
  https://bugzilla.kernel.org/show_bug.cgi?id=201291

  I bisected the problem down to this commit (and the few before it,
  which also added extra MCE output, but didn't actually crash.