Re: [PATCH v3 2/3] dmr/amdgpu: Avoid HW GPU reset for RAS.

2019-08-30 Thread Kuehling, Felix
On 2019-08-30 12:39 p.m., Andrey Grodzovsky wrote: > Problem: > Under certain conditions, when some IP bocks take a RAS error, > we can get into a situation where a GPU reset is not possible > due to issues in RAS in SMU/PSP. > > Temporary fix until proper solution in PSP/SMU is ready: > When uncor

Re: [PATCH v3 2/3] dmr/amdgpu: Avoid HW GPU reset for RAS.

2019-08-30 Thread Alex Deucher
On Fri, Aug 30, 2019 at 12:39 PM Andrey Grodzovsky wrote: > > Problem: > Under certain conditions, when some IP bocks take a RAS error, > we can get into a situation where a GPU reset is not possible > due to issues in RAS in SMU/PSP. > > Temporary fix until proper solution in PSP/SMU is ready: >

[PATCH v3 2/3] dmr/amdgpu: Avoid HW GPU reset for RAS.

2019-08-30 Thread Andrey Grodzovsky
Problem: Under certain conditions, when some IP bocks take a RAS error, we can get into a situation where a GPU reset is not possible due to issues in RAS in SMU/PSP. Temporary fix until proper solution in PSP/SMU is ready: When uncorrectable error happens the DF will unconditionally broadcast err