Public bug reported:

Modern Intel Processors (since Skylake) with HWP (HardWare Pstate)
control enabled and Idle State 2, C1E, enabled can incorrectly drop the
CPU frequency with an extremely slow recovery time.

The fault is not within HWP itself, but within the internal idle
detection logic. One difference between OS driven pstate control and HWP
driven pstate control is that the OS knows the system was not actually
idle, but HWP does not. Another difference is the incredibly sluggish
recovery with HWP.

The problem only occurs when Idle State 2, C1E, is involved. Not all
processors have the C1E idle state. The issue is independent of C1E
auto-promotion, which is turned off in general, as far as I know.

With all idle states enabled the issue is rare. The issue would manifest
itself in periodic workflows, and would be extremely difficult to
isolate (It took me over 1/2 a year).

The purpose of this bug report is to link to the upstream bug report,
where readers can find tons of detail. I'll also set it to confirmed, as
it has already been verified on 4 different processor models, and I do
not want the bot asking me for files that are not required.

Workarounds include:
. don't use HWP.
. disable idle state 2, C1E
. change the C1E idle state to use MWAIT 0x03 instead of MWAIT 0x01 (still in 
test. documentation on the MWAIT least significant nibble is scant).

** Affects: linux
     Importance: Unknown
         Status: Unknown

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: Confirmed

** Changed in: linux (Ubuntu)
       Status: New => Confirmed

** Summary changed:

- HWP and C1E are incompatible - Intel prcoessors 
+ HWP and C1E are incompatible - Intel processors

** Bug watch added: Linux Kernel Bug Tracker #210741
   https://bugzilla.kernel.org/show_bug.cgi?id=210741

** Also affects: linux via
   https://bugzilla.kernel.org/show_bug.cgi?id=210741
   Importance: Unknown
       Status: Unknown

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1917813

Title:
  HWP and C1E are incompatible - Intel processors

Status in Linux:
  Unknown
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Modern Intel Processors (since Skylake) with HWP (HardWare Pstate)
  control enabled and Idle State 2, C1E, enabled can incorrectly drop
  the CPU frequency with an extremely slow recovery time.

  The fault is not within HWP itself, but within the internal idle
  detection logic. One difference between OS driven pstate control and
  HWP driven pstate control is that the OS knows the system was not
  actually idle, but HWP does not. Another difference is the incredibly
  sluggish recovery with HWP.

  The problem only occurs when Idle State 2, C1E, is involved. Not all
  processors have the C1E idle state. The issue is independent of C1E
  auto-promotion, which is turned off in general, as far as I know.

  With all idle states enabled the issue is rare. The issue would
  manifest itself in periodic workflows, and would be extremely
  difficult to isolate (It took me over 1/2 a year).

  The purpose of this bug report is to link to the upstream bug report,
  where readers can find tons of detail. I'll also set it to confirmed,
  as it has already been verified on 4 different processor models, and I
  do not want the bot asking me for files that are not required.

  Workarounds include:
  . don't use HWP.
  . disable idle state 2, C1E
  . change the C1E idle state to use MWAIT 0x03 instead of MWAIT 0x01 (still in 
test. documentation on the MWAIT least significant nibble is scant).

To manage notifications about this bug go to:
https://bugs.launchpad.net/linux/+bug/1917813/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to