.com>; Zhang, Hawking
<mailto:hawking.zh...@amd.com>; Liu, Monk
<mailto:monk@amd.com>; Xu, Feifei
<mailto:feifei...@amd.com>; Wang, Kevin(Yang)
<mailto:kevin1.w...@amd.com>; Xiaojie Yuan
<mailto:xiaojie.y...@amd.com>
主题: Re: 回复: [PATCH 1/2] drm/amdgpu: race issue
nder
; Xiao, Jack ; Zhang, Hawking
; Liu, Monk ; Xu, Feifei
; Wang, Kevin(Yang) ; Xiaojie Yuan
*主题:* Re: 回复: [PATCH 1/2] drm/amdgpu: race issue when jobs on 2 ring timeout
On 1/19/21 11:39 AM, Chen, Horace wrote:
[AMD Official Use Only - Internal Distribution Only]
Hi Andrey,
I think the
t;; Xu, Feifei
<mailto:feifei...@amd.com>; Wang, Kevin(Yang)
<mailto:kevin1.w...@amd.com>; Xiaojie Yuan
<mailto:xiaojie.y...@amd.com>
主题: Re: [PATCH 1/2] drm/amdgpu: race issue when jobs on 2 ring timeout
On 1/19/21 7:22 AM, Horace Chen wrote:
> Fix a r
[PATCH 1/2] drm/amdgpu: race issue when jobs on 2 ring timeout
On 1/19/21 7:22 AM, Horace Chen wrote:
> Fix a racing issue when jobs on 2 rings timeout simultaneously.
>
> If 2 rings timed out at the same time, the amdgpu_device_gpu_recover
> will be reentered. Then the adev->gmc.xgm
; Wang, Kevin(Yang) ; Xiaojie Yuan
主题: Re: [PATCH 1/2] drm/amdgpu: race issue when jobs on 2 ring timeout
On 1/19/21 7:22 AM, Horace Chen wrote:
> Fix a racing issue when jobs on 2 rings timeout simultaneously.
>
> If 2 rings timed out at the same time, the amdgpu_device_gpu_recover
>
On 1/19/21 7:22 AM, Horace Chen wrote:
Fix a racing issue when jobs on 2 rings timeout simultaneously.
If 2 rings timed out at the same time, the amdgpu_device_gpu_recover
will be reentered. Then the adev->gmc.xgmi.head will be grabbed
by 2 local linked list, which may cause wild pointer
,
Alexander ; Xiao, Jack ; Zhang,
Hawking ; Liu, Monk ; Xu, Feifei
; Wang, Kevin(Yang) ; Xiaojie Yuan
Subject: [PATCH 1/2] drm/amdgpu: race issue when jobs on 2 ring timeout
Fix a racing issue when jobs on 2 rings timeout simultaneously.
If 2 rings timed out at the same time
On 1/14/21 8:37 AM, Horace Chen wrote:
Fix a racing issue when jobs on 2 rings timeout simultaneously.
If 2 rings timed out at the same time, the
amdgpu_device_gpu_recover will be reentered. Then the
adev->gmc.xgmi.head will be grabbed by 2 local linked list,
which may cause wild pointer