[AMD Official Use Only - Internal Distribution Only]

Reviewed-by: DennisLi <[email protected]>

Best Regards
Dennis Li
-----Original Message-----
From: Chen, Guchun <[email protected]> 
Sent: Monday, August 10, 2020 1:23 PM
To: [email protected]; Zhang, Hawking <[email protected]>; Li, 
Dennis <[email protected]>; Lazar, Lijo <[email protected]>; Zhou1, Tao 
<[email protected]>; Clements, John <[email protected]>
Cc: Chen, Guchun <[email protected]>
Subject: [PATCH 2/2] drm/amdgpu: add debugfs node to toggle ras error cnt 
harvest

Before ras recovery is issued, user could operate this debugfs node to 
enable/disable the harvest of all RAS IPs' ras error count registers, which 
will help keep hardware's registers'
status instead of cleaning up them.

Signed-off-by: Guchun Chen <[email protected]>
---
 drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 7 +++++++
 1 file changed, 7 insertions(+)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c 
b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
index e6978b8e2143..31df6bf2dc1f 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
@@ -1215,6 +1215,13 @@ static void amdgpu_ras_debugfs_create_ctrl_node(struct 
amdgpu_device *adev)
         */
        debugfs_create_bool("auto_reboot", S_IWUGO | S_IRUGO, con->dir,
                                &con->reboot);
+
+       /*
+        * User could set this not to clean up hardware's error count register
+        * of RAS IPs during ras recovery.
+        */
+       debugfs_create_bool("disable_ras_err_cnt_harvest", 0644,
+                       con->dir, &con->disable_ras_err_cnt_harvest);
 }
 
 void amdgpu_ras_debugfs_create(struct amdgpu_device *adev,
--
2.17.1
_______________________________________________
amd-gfx mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

Reply via email to