t-vi commented on pull request #7935:
URL: https://github.com/apache/tvm/pull/7935#issuecomment-828778315


   I don't think the descriptions are entirely accurate, but the Vega ISA 
manual says
   
   > This does not access LDS memory and may be called even if no LDS memory is 
allocated to the wave. It uses LDS hardware to implement an arbitrary swizzle 
across threads in a wavefront. 
   
   so I would expect that the performance lies somewhere between using LDS and 
registers. I can imagine that doing a lot less writing might save time in this 
specific case, but it probably is best to check with AMD before drawing global 
conclusions.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to