This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a change to branch nightly
in repository https://gitbox.apache.org/repos/asf/tvm.git
from c4f10cd5e9 [Runtime] Device API to query L2 cache size (#15332)
add e2d6511161 [Bugfix][Frontend][Keras]Fix a corner case bug in softmax
converter of keras frontend (#15337)
add c0946e19cd [Runtime] Flush L2 cache in time eval (#15305)
add 4b183daa97 [skipci] Fix typo in docs/arch/index.rst (#15312)
add a13b56a945 [OP] Add `rms_norm` into TOPI (#15326)
add d81e8809b8 [AOT] Avoid call_extern() with incorrect argument count
(#15301)
add 2eca9f0270 [TIR] Return error code from kernels in SplitHostDevice
(#15241)
No new revisions were added by this update.
Summary of changes:
docs/arch/index.rst | 4 +-
include/tvm/runtime/profiling.h | 4 +-
include/tvm/topi/nn/rms_norm.h | 96 ++++++++++++++++++++++
python/tvm/relay/frontend/keras.py | 6 +-
python/tvm/runtime/module.py | 5 ++
python/tvm/topi/nn/__init__.py | 1 +
python/tvm/topi/nn/{layer_norm.py => rms_norm.py} | 12 +--
python/tvm/topi/testing/__init__.py | 1 +
.../{layer_norm_python.py => rms_norm_python.py} | 20 ++---
src/relay/backend/aot_executor_codegen.cc | 38 ++++++++-
src/runtime/crt/common/crt_runtime_api.c | 5 +-
.../graph_executor/debug/graph_executor_debug.cc | 2 +-
src/runtime/profiling.cc | 16 +++-
src/runtime/rpc/rpc_module.cc | 28 +++----
src/tir/transforms/lower_device_kernel_launch.cc | 41 ++++++++-
src/tir/transforms/split_host_device.cc | 33 +++++++-
src/topi/nn.cc | 6 ++
tests/python/frontend/keras/test_forward.py | 7 ++
tests/python/topi/python/test_topi_rms_norm.py | 68 +++++++++++++++
.../test_tir_transform_split_host_device.py | 38 +++++++++
web/emcc/tvmjs_support.cc | 2 +-
21 files changed, 385 insertions(+), 48 deletions(-)
create mode 100644 include/tvm/topi/nn/rms_norm.h
copy python/tvm/topi/nn/{layer_norm.py => rms_norm.py} (81%)
copy python/tvm/topi/testing/{layer_norm_python.py => rms_norm_python.py} (78%)
create mode 100644 tests/python/topi/python/test_topi_rms_norm.py