elvin-nnov opened a new pull request #7830: URL: https://github.com/apache/tvm/pull/7830
On example of float3 datatype: Using of float3 data type for loading of data cuncurrently into dense array shared between all threads in Metal threading group can lead to data race between threads. float3 datatype has size and and alignment eq to 16 bytes while kernel assumes to copy 12 bytes in arbitrary not aligned places. Using of packed_float3 datatypes solves the issue -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
