yaxunl added a comment.

In D135269#3837394 <https://reviews.llvm.org/D135269#3837394>, @tra wrote:

> Is there more info about the issue? What does AMDGPU currently emit for the 
> test case?
>
> AFAICT from running it on CE (https://godbolt.org/z/ccq3vnbrM) llvm optimizes 
> it to essentially `*y = *x` and generates a 1-byte load+store for both NVPTX 
> and AMDGPU.

The issue happens to more complicated test cases which I cannot reduce right 
now.

Basically 8018d6be3459780e81a5da128a9915eb27909902 
<https://reviews.llvm.org/rG8018d6be3459780e81a5da128a9915eb27909902> caused 
regressions in some PyTorch tests. Investigation shows the propagation of range 
metadata for bool type triggered some optimizations which caused some bool 
values to be loaded incorrectly. I will continue investigating the issue. 
However, I need a workaround for now.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D135269/new/

https://reviews.llvm.org/D135269

_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to