This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a change to branch nightly
in repository https://gitbox.apache.org/repos/asf/tvm.git


    from 219ae85d4b [Runtime Patch] Add AbortSignal to fetchWithCache in 
ArtifactCacheTemplate interface (#17233)
     add 76b954a09e [3rdparty] Bump FlashInfer (#17236)
     add 21c12fb124 [Bugfix][Cutlass] fix cutlass instantiate attention 
template bugs (#17229)
     add cd09ab64b5 [Runtime] Reorganize PagedKVCache attn kernel invocation 
(#17237)

No new revisions were added by this update.

Summary of changes:
 3rdparty/flashinfer                                |   2 +-
 python/tvm/contrib/cutlass/attention_operation.py  |   2 +-
 src/relax/transform/fuse_ops.cc                    |   2 +-
 src/runtime/relax_vm/paged_kv_cache.cc             | 163 ++++++++++++---------
 ..._builtin_paged_attention_kv_cache_flashinfer.py |  13 +-
 ...runtime_builtin_paged_attention_kv_cache_tir.py |  13 +-
 6 files changed, 118 insertions(+), 77 deletions(-)

Reply via email to