[GitHub] [tvm] masahi commented on a diff in pull request #15656: [Hopper TMA] Add CUDA codegen support for bulk asynchronous copy

2023-09-05 Thread via GitHub
masahi commented on code in PR #15656: URL: https://github.com/apache/tvm/pull/15656#discussion_r1315592599 ## include/tvm/tir/builtin.h: ## @@ -645,14 +645,29 @@ TVM_DLL const Op& ptx_mma_sp(); TVM_DLL const Op& ptx_ldmatrix(); /*! - * \brief tvm intrinsics for ptx async

[GitHub] [tvm] masahi commented on a diff in pull request #15656: [Hopper TMA] Add CUDA codegen support for bulk asynchronous copy

2023-09-05 Thread via GitHub
masahi commented on code in PR #15656: URL: https://github.com/apache/tvm/pull/15656#discussion_r1315572243 ## python/tvm/tir/op.py: ## @@ -1458,16 +1512,42 @@ def ptx_arrive_barrier(barrier_arr, barrier_id): return call_intrin("", "tir.ptx_arrive_barrier", barrier_arr,