[AMD Official Use Only - AMD Internal Distribution Only]

Reviewed-by: Hawking Zhang <[email protected]>

Regards,
Hawking
-----Original Message-----
From: Lazar, Lijo <[email protected]>
Sent: Friday, May 9, 2025 23:32
To: [email protected]
Cc: Zhang, Hawking <[email protected]>; Deucher, Alexander 
<[email protected]>; Wang, Yang(Kevin) <[email protected]>; Zhou1, 
Tao <[email protected]>
Subject: Re: [PATCH] drm/amdgpu: Log RAS errors during load

<Ping>

On 5/6/2025 4:45 PM, Lijo Lazar wrote:
> During driver load, RAS event manager may not be initialized. This
> will cause any ATHUB event during driver load to be skipped in dmesg
> log. Log the error in dmesg log for easier diagnosis.
>
> Signed-off-by: Lijo Lazar <[email protected]>
> ---
>  drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 ++++-
>  1 file changed, 4 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
> b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
> index 82013b495436..85a70b646840 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c
> @@ -4485,8 +4485,11 @@ void amdgpu_ras_global_ras_isr(struct amdgpu_device 
> *adev)
>               enum ras_event_type type = RAS_EVENT_TYPE_FATAL;
>               u64 event_id;
>
> -             if (amdgpu_ras_mark_ras_event(adev, type))
> +             if (amdgpu_ras_mark_ras_event(adev, type)) {
> +                     dev_err(adev->dev,
> +                             "uncorrectable hardware error 
> (ERREVENT_ATHUB_INTERRUPT)
> +detected!\n");
>                       return;
> +             }
>
>               event_id = amdgpu_ras_acquire_event_id(adev, type);
>

Reply via email to