krzysz00 wrote: > There actually are two variants. There is an existing non-async variant which > hides the counter and the compiler magically takes care of it.
As far as I'm aware, this is incorrect. TDM has a complex enough aliasing relationship that, as far as I'm aware, there is no attempt to insert tensorcnt waits, nor is there any plan for one. To be clear, this isn't about the gfx9 LDS DMAs - those have their "sync" variant for legacy / it uses vmcnt reasons. None of the gfx1250 DMAs have this, and, as I recall previous discussion, they're not *supposed* to have this. https://github.com/llvm/llvm-project/pull/200775 _______________________________________________ cfe-commits mailing list [email protected] https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
