This bug seems to be the closest match to what I'm experiencing. I'm
hitting similar trouble on desktop/graphical Ubuntu on some Dell
laptops.

Ubuntu 16.04 AMD64.

Happening on kernel versions 4.13.0-38 and 4.4.0-116.

I've tried version 4.4.0-117 as posted by @kamalmostafa - same result.

The laptops are an XPS 13 and a Precision M3520. Both machines have Kaby
Lake/7th gen Core i7 CPUs.

The problem on the 4.4.0-116 kernel (on my M3520) occurred the first
reboot after updating the Intel microcode. I'm starting to think this is
related to the Meltdown/Spectre patching.

I started sssd in the foreground with debug set to 9 on the M3520 and
netcat'd the results to another machine, so I think I have it right up
until the laptop stops responding. It's obviously quite verbose, but if
someone thinks it'll help, I'll post it.

>From talking to some of the people on #ubuntu-server, the microcode does
seem to be increasingly likely the root cause. Bug #1759920 may include
relevant information.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1746806

Title:
  sssd appears to crash AWS c5 and m5 instances, cause 100% CPU

Status in cloud-images:
  Fix Released
Status in linux package in Ubuntu:
  In Progress
Status in linux-aws package in Ubuntu:
  Confirmed
Status in linux source package in Xenial:
  In Progress
Status in linux-aws source package in Xenial:
  In Progress

Bug description:
  After upgrading to the Ubuntu EC2 AMI from 20180126 (specifically
  ami-79873901 in us-west-2) we have seen sssd hard locking c5 and m5
  EC2 instances after starting the service and CPU goes to 100%.

  We do not experience this issue with t2 or c4 instance types and we do
  not see this issue on any instance types using Ubuntu Cloud images
  from 20180109 or before. I have verified that this is kernel related
  as I booted an image that we created using the Ubuntu cloud image from
  20180109 which works fine on a c5. I then did a "apt update && apt
  install --only-upgrade linux-aws && systemctl disable sssd", rebooted
  the server, verified I was on the new kernel and started sssd with
  "systemctl start sssd" and the EC2 instance froze and Cloudwatch CPU
  usage for that instance went to 100%.

  I haven't been able to find much in the syslog, kern.log, journalctl
  logs, etc. The only thing I have been able to find is that when this
  happens I tend to see "^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@" in
  the syslog and sssd log files.  I have attached several log files and
  the output of a "apport-bug /usr/sbin/sssd". Let me know if you need
  anything else to help track this down.

  Thanks,
  Paul

To manage notifications about this bug go to:
https://bugs.launchpad.net/cloud-images/+bug/1746806/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to