I'm assuming the 5.14.15 or 5.14.16 amdgpu and amdkfd has been backported to 
the 5.13 Ubuntu kernel.  Here's the patch in 5.14.17 that specifically 
addresses this.  
 
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?h=v5.14.17&id=7883e13c249461877ea3be7b24a5935fc8946e46

The other amd-related patches for 5.14.17 largely appear to be fixes to
DCN3.1 support (newest GPU models from last 6 months or so.)  This is a
fairly serious regression for those affected (I may have only gotten to
a desktop because I"m using Gnome Flashback so no compositor trying to
exercise the 3D hardware for desktop use.  I've simply gone back to
5.13.0-22 for now.)  If the plan is to ship a quick update, I could see
just patching in that one patch; if it'll be fixed in 5.13.0-23 at a
usual schedule I could see incorporating all of them to benefit DCN 3.1
users.

Not to dissemble, but kudos to the open source GPU driver developers,
the Intel support's amazing (it's amusing on my friends Sandybridge,
that he can run DX11 games in steam through Proton that it would not be
able to run in Windows since Intel never shipped DX11 drivers for
it...), and amdgpu has run every game I've thrown at it so far,
generally at very good frame rates.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-hwe-5.13 in Ubuntu.
https://bugs.launchpad.net/bugs/1956401

Title:
  amdgpu hangs for 90 seconds at a time

Status in linux-hwe-5.13 package in Ubuntu:
  Confirmed

Bug description:
  This does not occur with linux-image-5.13.0-22-generic, but does with 
linux-image-5.13.0-33-generic.
  On startup, I get about a 60 second hang, with the following in the kernel 
dmesg:
  Jan  4 15:26:36 inspiron-3505 kernel: [   34.160572] amdgpu 0000:04:00.0: 
amdgp : failed to write reg 28b4 wait reg 28c6
  Jan  4 15:26:56 inspiron-3505 kernel: [   54.189055] amdgpu 0000:04:00.0: 
amdgp : failed to write reg 1a6f4 wait reg 1a706
  Jan  4 15:27:16 inspiron-3505 kernel: [   74.329264] amdgpu 0000:04:00.0: 
amdgp : failed to write reg 28b4 wait reg 28c6
  Jan  4 15:27:36 inspiron-3505 kernel: [   94.337904] amdgpu 0000:04:00.0: 
amdgp : failed to write reg 1a6f4 wait reg 1a706
  I have the following GPU:
  04:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] 
Picass
  o (rev c2) (prog-if 00 [VGA controller])
  04:00.0 0300: 1002:15d8 (rev c2)
  (This is a Ryzen 5 3450U CPU with Radeon Vega Mobile.)

  I get a similar hang if I start firefox (when it's probing OpenGL
  contexts), and even with glxgears and glxinfo. Seems like anything
  that'd kick on a OpenGL context does it.  I had a freeze as well when
  I tried running firefox and glxgears both.  Along with odd BUG:
  messages logged (I have some in the attached log.)

  I was running with "iommu=pt", but did try with this removed, still
  got the errors (I think amdgpu driver uses the IOMMU even when it's
  set to IOMMU=pt though.).  See the attached log for some very odd
  "[Hardware Error]" messages that were logged on one test run.  I think
  this was when I tried to run firestorm (second life viewer) -- that
  had a large pause then opened to a black window.

  Per Google, I see there was a bug like this that turned up in kernel
  5.14.15 but fixed in 5.14.17.  See
  https://gitlab.freedesktop.org/drm/amd/-/issues/1770

  Thanks!
  --Henry

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-5.13/+bug/1956401/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to