On 17-07-27 03:46 PM, Alex Deucher wrote:
> No functional change until wptr polling uses this
> location (future patch).
>
> Cc: Frank Min <[email protected]>
> Signed-off-by: Alex Deucher <[email protected]>
> ---
>  drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 4 ++--
>  1 file changed, 2 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c 
> b/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c
> index 7cb5320..9392799 100644
> --- a/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c
> +++ b/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c
> @@ -301,8 +301,8 @@ static void sdma_v4_0_ring_set_wptr(struct amdgpu_ring 
> *ring)
>                               lower_32_bits(ring->wptr << 2),
>                               upper_32_bits(ring->wptr << 2));
>               /* XXX check if swapping is necessary on BE */
> -             adev->wb.wb[ring->wptr_offs] = lower_32_bits(ring->wptr << 2);
> -             adev->wb.wb[ring->wptr_offs + 1] = upper_32_bits(ring->wptr << 
> 2);
> +             atomic64_set((atomic64_t *)&adev->wb.wb[ring->wptr_offs],
> +                          (ring->wptr << 2));

This looks a bit awkward. Would "writeq" do the job? That's what we use
for updating kernel queue doorbells in KFD.

The generic implementation of atomic64_set uses a spinlock. But that
doesn't do anything for protecting against concurrent access by the GPU
hardware.

Regards,
  Felix

>               DRM_DEBUG("calling WDOORBELL64(0x%08x, 0x%016llx)\n",
>                               ring->doorbell_index, ring->wptr << 2);
>  

_______________________________________________
amd-gfx mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

Reply via email to