jayfoad wrote:

> Implemented compiler option `-mamdgpu-expand-waitcnt-profiling` to expand 
> `s_waitcnt` instructions with non-zero counter values into sequences for 
> PC-sampling profiling.

Why would you restrict this to "non-zero counter values"?

Why does this need a new subtarget feature?

https://github.com/llvm/llvm-project/pull/169345
_______________________________________________
cfe-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to