** Description changed:
+ [SRU Justification]
+
+ [Impact]
+
+ Products containing gfx1151 architecture with multiple microcontrollers
+ (VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
+ loading or with stress applications on the CRB. This requires rebasing
+ these firmware versions to eliminate the risk.
+
+ [Fix]
+
+ * 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
+ * 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
+ * 4a172771d ("amdgpu: update psp 14.0.1 firmware")
+ * d316e650c ("amdgpu: update gc 11.5.1 firmware")
+ * f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
+
+ [Test Case]
+
+ This was reported by hardware vendor using proprietary GPU stress
+ software that is not freely available to generate heavy 3D rendering
+ workload.
+
+ [Where problems could occur]
+
+ Opaque GPU firmware limited to related platforms. There might be further
+ stability issues that need additional fixes from kernel, and we can only
+ find out with more deployments later.
+
+ [Other Info]
+
+ Nominate only for Noble and Oracular, because Plucky already has all of
+ them since version 20250204.git0fd450ee-0ubuntu1.
+
+ ========== original bug report ==========
+
Products containing gfx1151 architecture with multiple microcontrollers
(VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
loading or with stress applications on the CRB. This requires rebasing
these firmware versions to eliminate the risk.
# upstream tag 20250211
* 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
# upstream tag 20250109
# upstream tag 20241210
* 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
* 4a172771d ("amdgpu: update psp 14.0.1 firmware")
* d316e650c ("amdgpu: update gc 11.5.1 firmware")
# upstream tag 20241110
# upstream tag 20240811
* f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
# upstream tag 20240709
[ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270433] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
[ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data)
(0xa)
[ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
[ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270454] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270470] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
- ---
+ ---
ProblemType: Bug
ApportVersion: 2.28.1-0ubuntu3.3
Architecture: amd64
CRDA: N/A
CasperMD5CheckResult: pass
Dependencies: firmware-sof-signed 2023.12.1-1ubuntu1.4
DistroRelease: Ubuntu 24.04
InstallationDate: Installed on 2024-05-07 (308 days ago)
InstallationMedia: Ubuntu 24.04 LTS "Noble Numbat" - Release amd64 (20240424)
IwConfig:
- lo no wireless extensions.
-
- enp193s0f0 no wireless extensions.
+ lo no wireless extensions.
+
+ enp193s0f0 no wireless extensions.
MachineType: AMD MAPLE
Package: linux-firmware
PackageArchitecture: amd64
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.11.0-1016-oem
root=UUID=ff988e57-9bf4-46d6-94cf-1e35d0139e12 ro
amdgpu.ip_block_mask=0xfffffcff quiet splash vt.handoff=7
ProcVersionSignature: Ubuntu 6.11.0-1016.16-oem 6.11.11
RelatedPackageVersions:
- linux-restricted-modules-6.11.0-1016-oem N/A
- linux-backports-modules-6.11.0-1016-oem N/A
- linux-firmware 20240318.git3b128b60-0ubuntu2.10
+ linux-restricted-modules-6.11.0-1016-oem N/A
+ linux-backports-modules-6.11.0-1016-oem N/A
+ linux-firmware 20240318.git3b128b60-0ubuntu2.10
RfKill:
-
+
Tags: noble package-from-proposed third-party-packages
Uname: Linux 6.11.0-1016-oem x86_64
UnreportableReason: This does not seem to be an official Ubuntu package.
Please retry after updating the indexes of available packages, if that does not
work then remove related third party packages and try again.
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 04/25/2024 21:25:16
dmi.bios.release: 0.0
dmi.bios.vendor: AMD
dmi.bios.version: RG60061C
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: MAPLE-STXH
dmi.board.vendor: AMD
dmi.board.version: RevB
dmi.chassis.asset.tag: Chassis Asset Tag
dmi.chassis.type: 10
dmi.chassis.vendor: AMD
dmi.chassis.version: 12345
dmi.ec.firmware.release: 0.23
dmi.modalias:
dmi:bvnAMD:bvrRG60061C:bd04/25/2024212516:br0.0:efr0.23:svnAMD:pnMAPLE:pvrRG60061C:rvnAMD:rnMAPLE-STXH:rvrRevB:cvnAMD:ct10:cvr12345:sku12345678:
dmi.product.family: STXH
dmi.product.name: MAPLE
dmi.product.sku: 12345678
dmi.product.version: RG60061C
dmi.sys.vendor: AMD
** Changed in: linux-firmware (Ubuntu Oracular)
Status: New => In Progress
** Changed in: linux-firmware (Ubuntu Noble)
Status: New => In Progress
** Changed in: linux-firmware (Ubuntu Noble)
Importance: Undecided => High
** Changed in: linux-firmware (Ubuntu Noble)
Assignee: (unassigned) => You-Sheng Yang (vicamo)
** Changed in: linux-firmware (Ubuntu Oracular)
Assignee: (unassigned) => You-Sheng Yang (vicamo)
** Changed in: linux-firmware (Ubuntu Oracular)
Importance: Undecided => High
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2100769
Title:
Update amdgpu FW for GC 11.5.1
To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2100769/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs