yaxunl added a comment. In D135269#3837394 <https://reviews.llvm.org/D135269#3837394>, @tra wrote:
> Is there more info about the issue? What does AMDGPU currently emit for the > test case? > > AFAICT from running it on CE (https://godbolt.org/z/ccq3vnbrM) llvm optimizes > it to essentially `*y = *x` and generates a 1-byte load+store for both NVPTX > and AMDGPU. The issue happens to more complicated test cases which I cannot reduce right now. Basically 8018d6be3459780e81a5da128a9915eb27909902 <https://reviews.llvm.org/rG8018d6be3459780e81a5da128a9915eb27909902> caused regressions in some PyTorch tests. Investigation shows the propagation of range metadata for bool type triggered some optimizations which caused some bool values to be loaded incorrectly. I will continue investigating the issue. However, I need a workaround for now. CHANGES SINCE LAST ACTION https://reviews.llvm.org/D135269/new/ https://reviews.llvm.org/D135269 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits