elvin-nnov opened a new pull request #7830:
URL: https://github.com/apache/tvm/pull/7830


   On example of float3 datatype:
   Using of float3 data type for loading of data cuncurrently into dense array 
shared
   between all threads in Metal threading group can lead to data race between 
threads.
   float3 datatype has size and and alignment eq to 16 bytes while kernel 
assumes to
   copy 12 bytes in arbitrary not aligned places.
   Using of packed_float3 datatypes solves the issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to