This is an automated email from the ASF dual-hosted git repository.
syfeng pushed a change to branch main
in repository https://gitbox.apache.org/repos/asf/tvm.git.
from 4087e72 [CUTLASS] Fix hardcoded include path and logic for
profile_all = False case (#9399)
add a6e90b9 [CUDA] Support memory reuse for dynamic shared memory (#9341)
No new revisions were added by this update.
Summary of changes:
src/driver/driver_api.cc | 3 +-
src/target/source/codegen_cuda.cc | 2 +-
.../merge_dynamic_shared_memory_allocations.cc | 487 +++++++++++++++++++--
...form_merge_dynamic_shared_memory_allocations.py | 85 +++-
4 files changed, 546 insertions(+), 31 deletions(-)