This is an automated email from the ASF dual-hosted git repository.
masahi pushed a change to branch ci-docker-staging
in repository https://gitbox.apache.org/repos/asf/tvm.git
discard dad13d1c1b Testing PR13529.
add 49c92d9351 [TIR][Fix] Buffer slicing using index dtype as extent
(#13788)
add a530df8c8f [TIR][Fix] IndexDataTypeNormalizer not unwrapping float
casting (#13789)
add 38a9abb46c [TIR] Fix cache_write bug with allocate const node (#13792)
add 1acb907346 [FIX] Fix Typos in Docs and Comments (#13793)
add 125886350a [TOPI] Fix tuple unpack in conv2d NCHWc int8 (#13761)
add 3b49f7973d [FIX] Minor Compilation Warning Fixes (#13794)
add e25e61857c [Hexagon] Add hexagon user DMA intrins for tensorization
(#13719)
add c9b4016000 [CMSIS-NN] Remove support for the old CMSIS NN project
(#13760)
add 328122675d [TIR][TOPI][x86][CI] Support skylake avx512 (#13621)
add 2e2ae26898 [CI] Cross-compile libtvm_runtime to Aarch64 and run tests
(#13714)
add 7e3dc45fed [TVMScript] Migrate More to TVMScripr Printer (#13785)
add 20b1b80f25 [skip ci][ci][docker] Add cross compilation libs (#13800)
add 6c5be6fbd0 [TVMScript] `T.axis.remap` syntax sugar for TVMScript
printer (#13743)
add da99e9d1b5 [TVMScript] Use TVMScript for all TIR Printing (#13795)
add 0730422ff6 [microNPU] Upgrade to 22.08 version of Arm(R) Ethos(TM)-U
NPU drivers (#13529)
add 60358a145b [microNPU] Add hardware constraints for binary elementwise
(#13772)
add 02e8bbfab6 [Bugfix][TIR] Fix version conflict with typing for Python
3.8.0 (#13744)
add 5d2a947204 [TOPI] Making test_strided_set require a GPU for testing
(#13804)
add a75f110c0e [TIR]Fix the crash of the pass RemoveNoOp (#13808)
add 6e01f3d855 [Adreno] Update interface of AnnotateMemoryScope pass
(#13779)
add cc352a4c34 [ONNX] Extend converter for Attention from Microsoft
onnxruntime contrib opset (#13797)
add c2eee01e6e [Docker]Add dialout group by default on login (#13810)
add cfa65b26c1 [microTVM] Add tutorial on how to generate MLPerfTiny
submissions (#13783)
add 6c2d485a01 [TVMScript] `T.match_buffer` syntax sugar in arguments for
TVMScript printer (#13801)
add 5e36ae331c [TIR][Schedule] Improve cache_index to cache common
subexpressions (#13700)
add 8f80e42114 [TVMScript] `T.allocate` with `T.decl_buffer` syntax sugar
for TVMScript printer (#13813)
add 8f738d001b [TESTING] Enable execution of test_packed_8x8x32_resnet50
(#13799)
add c8973a5507 [COMMUNITY] alanmacd -> Reviewer (#13814)
add fe372a7309 [docker][microTVM]Update zephyr version to 3.2 and Zephyr
SDK to 0.15.2 (#13806)
add efa5a388ce [CI] Update rerun list for tvm-bot (#13817)
add d907de3b35 [Frontend][PaddlePaddle] Add topk op and Fix bug, when the
output is a dimension, it … (#13701)
add ac9fb98857 [TVMScript] Implicit root block syntax sugar for TVMScript
printer (#13819)
add cc7def073b [Bugfix][TIR] Fix version conflict with typing for
different Python versions (3.8.0-3.10.0) (#13820)
add b77d24c384 [microTVM][CRT]Separate CRT template project from
standalone CRT build (#13812)
add 35a66f69e1 [MetaSchedule] add fp16-16-32 TensorCores rule to default
settings (#13822)
add 1bc6dd40c2 [TVMScript] Add ObjectPath to LiteralDoc (#13821)
add e79fac6300 [Metaschedule] get_top_k should not return not built
records (#13824)
add b738d31b87 [Hexagon][Metaschedule] Add timeout_sec arg to
get_hexagon_local_builder (#13828)
add 1d89071863 [TVMScript] More concise `T.allocate` syntax printing
(#13830)
add 1f40b925a5 [TOOL][NATIVE] Android native application for deploy and
run (#13791)
add e77a6d1a05 [TVMScript] Introduce `PrinterConfig` (#13831)
add 127fb1e915 [Hexagon]Float and quantized dense operators with schedules
(#12873)
add 6fdb1b7e38 [RUNTIME] Fix determination of big/little cores domains
(#13832)
add fd3f8035c9 [docker][microTVM]Fix Zephyr 0.15.2 SDK installation and
separate Zephyr python environment (#13829)
add cbdadb96e3 [Adreno] Optimize reduction schedule (#13781)
add 2c109c53e0 [TVMScript] Default to T.Buffer than T.buffer_decl (#13838)
add 5692600961 [TVMScript] Consolidate folder structure (#13841)
add 26d3244fb8 Enhance the --help message of composite target (#13842)
add 239edb5158 [microTVM] Allow multiple runners in tuning micro models
with meta-schedule (#13811)
add 697fdb2cb7 [TVMScript] Comments and docstrings printing (#13839)
add f7dfef4cde [ONNX] QGemm support (#13747)
add 4ddb87631b [microTVM] Fix host-driven AOT memory workspaces (#13807)
add ec72ac6690 [ROCM] Fixes compiling on ROCM 5 and accuracy on dense op
(#13847)
add 18b7dc1dd9 [MetaSchedule] Fix for RewriteLayout + AllocateConst when
the rank of the rewritten weight doesn't change (#13851)
add 56771a87d1 [CLML][RUNTIME] Enable more ops in CLML runtime (#13834)
add 2bfdcbe07a [Relay] Convert negative axes to positive when importing
ONNX Unsqueeze (#13846)
add 16b19582a2 [ETHOSN] Apply FoldConstant before NPU partitioning (#13848)
add 95fa22308b [Hexagon][CI] Updated sha for builder LLVM (#13418)
add c2cc01910c [microTVM] Update tutorials (#13845)
add 1bc8cf80d0 [ONNX] Support Bernoulli op on ONNX front-end (#13802)
add 0c2ab1bb42 [Arith] Support eq in detect_clip_bound (#13746)
add acde2c1fc3 [topi] remove comment redundancy in resize.py (#13860)
add 10d6c17238 [WASM][FIX] test tests/node/websock_rpc_test.py (#13862)
add 3c81d9bbac [RUNTIME][CLML] OpenCLML tuning and profiling enhanced
(#13843)
add e6262bf985 [ONNX] Support SequenceLength op (#13863)
add 76c5186e15 Enable C++17 for cmake modules (#13869)
add 803207c256 [Hexagon][CI] Update the docker image ID to reflect newer
LLVM (#13870)
add c81aaa852c [LLVM] Remove call to EmitDebugLocation from AddAliasInfo
(#13872)
add 558c994d1a [TIR][FIX] check args size when creating prim_func by
runtime::Registry (#13809)
add 4daf38f691 [TVMScript] Robustify the Highlight Printer (#13861)
add 0d5baacc02 [ONNX] Support SequenceErase op (#13865)
add d8833bdbbe [ONNX] Support SequenceEmpty op (#13866)
add 5e652c1a7a [CI] NNPACK build issue workaround (#13873)
add 73740385a9 [microTVM]Refactor test and add skip to current failing
tests/boards (#13858)
add 206f085b03 [TIR][TOPI][CI] Fix number of arguments in calls of
llvm_pure_intrin (#13881)
add c3fe08fb20 [ARITH] support floordiv in deduce bound (#13880)
add 7db77ada3d [Build] Fix misleading error messages (#13887)
add a1229f6fe4 [TIR] Handle nullptr returned by FindEntryFunc (#13852)
add d6f78b1acc [microTVM] Update Zephyr version and Zephyr SDK version
(#13818)
add 9bbc2c0ab7 [MetaSchedule] Use `shared.dyn` for Tensor Core Schedule
Rules (#13891)
add 2ced7a72a1 [microTVM] Clean-up test_crt.py and add to pylint (#13886)
add 2877c5a3cf [COMMUNITY] Wrongtest -> PMC (#13893)
add ba936e9547 [CUTLASS][Ansor] Combine CUTLASS and Ansor (#13879)
add 49849c8c3e Extend the USE_LIBBACKTRACE option (#13816)
add f0ea9e461a [RUNTIME] Fix the manual determination of cores in
FillDataForMeasure (#13849)
add 9008ec21ba [VM][DMLC] Lower memory usage when loading and dumping
weights (#13877)
add 7aecc1a44d [Torch] Fix advanced indexing with NoneType index arguments
(#13826)
add 37e1a6862c [QNN][Relay][Topi] Add qnn.dense with weight layout (#13854)
add ea34e6eb0b [TOPHUB] use keys as a keyword for searching of existing
statistics (#13874)
add 099ed94951 [OpenCL] Implement save/load pre-compiled programs (#13868)
add a89ff3e62f [tir] fix buffer_decl buffer allocation (#13906)
add 666006e926 [Doc] fix doc for tvm.te.const() (#13904)
add d35a8ab135 [CLML][CODEGEN] CLML native codegen utility (#13837)
add 209f074e96 [CI] Pass the 'path' parameter passed to cmake_build to the
task_build.py script (#13905)
add 583f5ae37c [ONNX] Support Bitwise operations (#13888)
add 62a69a6c93 [TVMC] Stop printing a wall of warnings with tvmc tune
(#13882)
add fb2315a266 [Fix][Arith] Analyzer simplification starts with canonical
(#13875)
add e34506caaa Fix numpy version constraint (#13912)
add 98008c2d67 [Fix][TVMScript] Fix `LetStmt` printing logic (#13900)
add f7aeaf1d38 [TVMScript] Connect `assert_structural_equal` with new
TVMScript printer (#13859)
add cec5f0bc10 [microNPU] Add rescale parameters for binary elementwise
(#13890)
add 60fcb0dede Revert the changes for NNPACK build issue (#13913)
add 5456fae776 [TVMScript,Fix] Fix findsource when classes are indented
(#13924)
add 282f17584b [Arith] Updated incorrect simplification rule (#13922)
add 7323e1d16f [OpenCL] Refactor OpenCL init function (#13919)
add c8b99a93a1 Fix build platform environment variable (#13914)
add 7149142781 [TVMScript] Schedule error reporting with new TVMScript
printer (#13921)
add 6fb961a945 [CI] Include static builds of the runtime as part of CI
(#13612)
add c36ae1cb3a [Arith][UnitTest] Parametrize tests of RewriteSimplifier
(#13923)
add 2e30e772a7 [CMSIS-NN] Add int16 add and mul operator support (#13920)
add 3ee29e38d8 [PyTorch] Fix in matmul function that enables working with
all sizes … (#13927)
add 36f45bb562 [Metaschedule,Fix] Move x86 feature detection to target
directory for metaschedule (#13925)
add 45a92df36a [TVMScript] Support `show_meta` (#13934)
add cf721d4c34 [TIR] Allow TransformLayout index_map to contain RVs
(#13930)
add 1de5c728ee [ETHOSN] Support for addition with constant input (#13931)
add 5cf3405033 [Frontend][Tensorflow] Update Select to SelectV2 (#13884)
add cff4568b8a [microNPU] Merge LUT activation with binary elementwise
operation (#13935)
add 142ea59fbf [Adreno] Extend pack_filter for HWIO layout (#13939)
add 6f0e2edd71 [Torch] Added tests in test_forward_linear (#13937)
add 256bad71ec [TVMScript][UX] Introduce decorator for deprecation (#13941)
add 3b141b9d25 [TIR] Expose bitwise ops to python (#13945)
add 0e8f52b631 [ci] Disable Windows-Static-Runtime (#13951)
add 2dd52dae3a [TensorRT] Fix BiasAdd with correct axis attribute (#13953)
add 78ea7958c9 [CI] Fix for NNPack error due to misalignment with
pthreadpool library (#13940)
add 49b6c3ae4e [Minor] Change file mode 755 -> 644; EOL CRLF -> LF (#13959)
add a49a7fee49 [Relay][Pass] Separate out the graph partitioning code from
fuse_ops.cc (#13964)
add 43c281053e [MetaSchedule] Fix a typo in MemoryDatabase (#13928)
add 09f38ac91c [TVMScript][Fix] Print Multi-line String as Metadata
(#13965)
add 8401bf6987 [TVMScript] Deterministic function ordering (#13962)
add 325161964b [TVMScript] Enable Safe Autocasting in BufferStore (#13960)
add 8b2b1655cd [Fix][MetaSchedule] RPCRunner timeout when queueing up
(#13963)
add a5a6e7fa1b Fix out of bound enum conversion (#13967)
add dc626f33e3 [TVMScript] Unify `T.handle` and `T.Ptr` (#13969)
add 82cf9f72d6 [TVMScript] Simplify TIR Var Definition (#13970)
add bea4919a75 [TVMScript] Encourage using T.Buffer directly (#13971)
add 266ff51d2a [ONNX] Extend reduction types supported by ScatterND
(#13946)
add a1dc4b91a4 [microTVM] Refactor required external functions in CRT to
platform-template.c (#13885)
add 53f4b2cce9 [CI] Enable USE_MICRO in minimal cross ISA build (#13942)
add b371c4625d [Hexagon] Improve cache management strategy for
HexagonBuffer (#13883)
add b5e0f6b8bb [Diagnostic] Support constructing Diagnostic Error through
ObjectRef (#13977)
add 74b9720880 [TOPI][Fix] Pool must return error if layout is tiled on H,
W, D dimensions (#13975)
add c6ce283fde [UX] Make T.prim_func typecheck as staticmethod (#13980)
add f6cebb5d60 [microTVM] additional refactoring for enabling USE_MICRO in
more builds (#13909)
add e516eaaea1 [testing][py_converter] Enhance py_converter to better
support entire modules (#13769)
add affcc11c2e [Keras] Fix importing conv2d_transpose for NHWC layout
(#13998)
add 1e5a830784 [microTVM]Refactor crt_config.h header file generation
(#13955)
add a6f4694bf9 [CI][Flaky] Skip zephyr_qemu-x86 tests that are part of
task_python_microtvm (#14005)
add 967d2bb807 [Build] Fix find_include_path when using TVM python package
(#14007)
add aa927df6b3 [Metaschedule] EvolutionarySearchNode::State constructor
typo fix (#14002)
add e7ad4bc071 [CLML] Changes corresponding to OpenCL workspace
refactorization (#13972)
add 0dd3d4aab6 [ACL][TESTING] Use pytest.mark.parametrize in ACL conv2d
tests (#14011)
add 0c965f442a [ONNX] Support ScatterElements with reduction (#13894)
add d7253fbc7e [TIR] Add cp.async support for tir.if_then_else (#13966)
add d12a636897 Refactor test to make it easier for user to understand how
tensor_intrin works (#14017)
add b1397367fc [FIX][RUNTIME] Convert container with function value type
(#14024)
add a543f6bebf [Arith] Implemented PMatchesOneOf and matches_one_of
(#13933)
add 14bc5e4585 [FIX][TOPI] Clip with IntImm/FloatImm (#14027)
add 6f232f91be [TIR] [Bugfix] Pass the correct block_sref_reuse to Replace
(#14023)
add ac57b013b2 [TIR] Update block flags and simplify predicate in
Reverse-Compute-Inline (#14030)
add 8613c79aa0 [TIR] Enable Host Func Attribute for PrimFunc (#14020)
add 054c11e101 [TOPI] Fix index dtype in topi strided_slice (#14022)
add 87bb8b1be8 [TIR] Introduce Pass InjectPTXLDG32 (#13973)
add 7249904622 [AutoScheduler][AutoTVM] Enable xgboost >= 1.7.x new
changes (#14036)
add 9f28b1da7a [WEB] Update web runtime to support latest emcc (#14046)
add c429a3b6a1 [WEB] WebGPU Codegen (#14048)
add 697c724e92 [Runtime] Add missing Type2Str for TVMByteArray (#14051)
add 5562d906f9 [BYOC] Update CUTLASS backend (SIMT support and codegen
clean up) (#14056)
add f80771f556 [Frontend][Paddle] Add norm and one_hot_v2 op (#14049)
add 5ec33bb558 [TOPI] Support non-batch cases for topi.nll_loss (#14060)
add 347d79cbcf [microTVM][CRT]Move Makefile to CMake to be cross-platform
compatible (#14013)
add 49e098971f [APP] Update android_rpc build tools version (#14052)
add b59da5904c [Target] Add target tags for Apple Silicon GPU (#14068)
add fbd641c92b [TOPI] Bugfix arm_cpu schedule_conv2d_spatial_pack_nhwc
schedule (#14003)
add 19d970cab1 [MetaSchedule] Fix anchor-block flow with empty design
space generator (#14047)
add a06c36c5b6 [CI] Fix test skipping pytest attribute (#14064)
add 0a269defe1 [Codegen][CUDA] Add error message for missing fragment info
(#14073)
add bd26c24c69 [MetaSchedule] MutateTileSize skip single-candidate
SampleCategorical (#14072)
add 6191711a16 [COMMUNITY] mkatanbaf -> Reviewer (#14085)
add 330fe8d17c [CRT] USE CMake for CRT standalone libraries (#14025)
add 5f610f45a0 [CI] Update Compute Library to v22.11 (#14084)
add 3cdb079407 [TORCH] scatter_reduce implementation (#14018)
add 1e988a4a6f [WEB] Reduce memleak in web runtime (#14086)
add b732e3c021 [AoT]Add get_input_name function to AoT Module (#14071)
add a3b51f11b2 [microTVM]Add test for MLPerfTiny models (#13978)
add be2bbe2cfb [COMMUNITY] Add new key for release signing (#14105)
add e0a825d636 [microTVM][Zephyr] Remove unnecessary use of
generate_c_interface_header (#14091)
add a8c5804c49 [CI] Add onnx dependency to
test_auto_tensorize.py::test_vnni_bert_int8 (#14102)
add c64f46212c [TOPI][Relay][ONNX] Replace scatter_add by
scatter_elements(reduction="add") (#14008)
add 125bbb271b [LLVM] Add support for DeclBufferNode (#14103)
add d5806eca76 [Frontend][Paddle] Add where_index op and add vm for paddle
frontend's unitest (#14099)
add 9b6df18f3a [TORCH][TOPI] Support mean reduction for scatter_reduce
(#14110)
add 265bebf65f [Runtime] Fix Potential DeviceAPIManager Memory Bug (#14114)
add 74bcca2535 [Docker] Update ci-cpu and ci-arm to tag
20230223-070143-a3b51f11b (#14116)
add f7165a1328 [microTVM] Fix tvmc tutorial (#14076)
add 10fb8c52d9 [MetaSchedule] Introduce Async Pipeline in MultiLevelTiling
(#14009)
add 9fab56c4c1 [TVMScript] Use op attribute to control whether to print
dtype in TVMScript (#14111)
add 1ad1994f5f [Fix][TVMScript] Fix index of metadata in printed script
(#14130)
add f21a17b67c [Pytorch] frontend full_impl fix (#14122)
add d9b0a80e1b [DOCKER] Configurable NDK version support (#14000)
add 54a62c1b53 [Fix][TIR] SampleCategorical apply-to-schedule (#14133)
add 74603eeac3 [Arith] ConstIntBound was incorrectly assuming bounds were
over int64… (#13918)
add 0e046daf9e [CMSIS-NN] Reduction in code size of AOT test runner binary
(#13815)
add 77df6e8d7c [CMSIS-NN] Add a runtime error message (#13643)
add bf589f3d11 [CRT]Cleanup unused macros in crt_config.h.template (#14125)
add 663f7ae77b [Fix][Relay] Fix axis transformation in squeeze shape
function (#14135)
add 4d152fe7c2 [Unittest] merge test_cp_async_in_if_then_else into
test_tir_transform_inject_ptx_async_copy (#14138)
add 2feb243bb7 [Frontend][TFLite] Fix conv2d import bug (#14124)
add 6097df5307 [ONNX][TORCH] Replace scatter op by scatter_elements
(#14019)
add 2b2cb96733 [TVMScript][Printer] Remove relax prefix for now (#14140)
add 7d67bb1be4 [microNPU] Sum legalization support (#13997)
add 7c06de52a1 [Fix][MetaSchedule] Fix redundant stages in async pipeline
for mlt (#14143)
add 428400c6e0 [COMMUNITY] Cheng Wen -> Reviewer (#14153)
add 1043136c9f [Runtime] Fix high RAM usage when saving / loading
paramters of big models (#14147)
add e9cf04e0e4 [Relay][Frontend] Span Filling PyTorch (#14050)
add 6c04ac52bd [TRT][BYOC] allow strided_slice ops on selected dimensions
(#14142) (#14144)
add 69acdfb042 [ONNX][TOPI] Add `DFT` operator (#13999)
add 908dc8f8ab [CRT][microTVM] Enable USMP by default for AoTExecutor +
CRT runtime (#14107)
add 25f4d06c55 [Android] Fix using system libraries in Android apps
(#14145)
add 05cbe329d8 [microTVM]Enable TVMC micro with AoT Executor (#14077)
add bd8e7d3bd3 [bugfix] Fix the write buffer scope of `mma_store_impl`
(#14174)
add cb37b82608 [Relay] Enhance EliminateCommonSubexpr to support Tuple
argument (#14169)
add 91dc8efe0f [TIR] Fix typo in doc (#14178)
add a42e98b195 [microTVM] Use QNN schedules to give SOTA performance
(#13752)
add bc92a3ff66 Add v0.11.0 docs link to site (#14181)
add df429c58d8 [TIR] Allow TransformLayout with non-inversible index map
(#14095)
add c0f148a231 [TIR][Analysis] Implement IdentifyMemCpy analysis function
(#13947)
add 736cecab3f [HotFix][MetaSchedule] Turn off database shash check
(#14188)
add 22c47ee6de [TOPI] Batch Norm Training Mode (#14190)
add baedf7f04d [TOPI] Group normalization (#14193)
add befdc4e631 [Fix][TIR] LowerCrossThreadReduction with write-back
predicate (#14199)
add a15ade30b1 [TVMC] Add option to dump TIR code to file (#14186)
add 424c749a3d [MetaSchedule] Tile and pack intermediate output for CUDA
TensorCore (#14108)
add e1c2ad18e9 [TVMScript] Distinguish LetStmt and Let expression (#14207)
add 9b9124721c [Frontend][PaddlePaddle] Fix bug in tests for upgrading
paddlepaddle to 2.4.2 (#14206)
add 012d6a72f6 [IR] Platform-independent SHash (#14204)
add 2f2d5d439a [MetaSchedule] preseve global_symbol attached to function
after applying MS (#14219)
add be66a7e0e4 [TVMScript] Sugar T.env_thread + T.launch_thread (#14217)
add 2c4af88563 [TVMScript] Sugar Var Definition in TIR Buffer (#14223)
add 082c4432fb [Refactor] Move `VarUseDefAnalysis` to header file (#14185)
add ca48cafae3 [Docker] Add script to build llvm from source (#13823)
add 56ddd37d78 [TIR] Enhance loop unroll with unroll local access (#14224)
add e59d1efc68 [Fix][TVMScript]TVMScript BinOP printing refactor (#14200)
add 9d732d0faa [TensorIR][Primitive] New schedule primitive
`reindex_cache_read/write` (#14161)
add e8576db88b [Fix]Fix function ObjectPath in IRModule SEqual (#14230)
add 902c2e2db7 [TVMScript] Support SizeVar Roundtripping (#14227)
add 1453fe0780 [PaddlePaddle Hackathon 4][Frontend][Paddle]Add
tile/mish/stack/unstack/silu/softshrink/where op for paddle frontend (#14160)
new 42b30ff2c5 update cpu and gpu image
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (dad13d1c1b)
\
N -- N -- N refs/heads/ci-docker-staging (42b30ff2c5)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.asf.yaml | 1 +
.github/workflows/main.yml | 35 +
.gitmodules | 3 +
3rdparty/cnpy | 1 +
3rdparty/cutlass | 2 +-
CMakeLists.txt | 45 +-
CONTRIBUTORS.md | 5 +-
KEYS | 60 +
LICENSE | 1 +
apps/android_camera/README.md | 15 +-
apps/android_camera/app/build.gradle | 11 +-
.../app/src/main/AndroidManifest.xml | 6 +-
apps/android_camera/build.gradle | 4 +-
apps/android_camera/models/prepare_model.py | 2 +-
apps/android_deploy/README.md | 25 +-
apps/android_deploy/app/build.gradle | 25 +-
apps/android_deploy/app/download-models.gradle | 18 +
.../app/src/main/AndroidManifest.xml | 6 +-
apps/android_deploy/build.gradle | 4 +-
apps/android_rpc/README.md | 21 +-
apps/android_rpc/app/build.gradle | 24 +-
apps/android_rpc/app/src/main/AndroidManifest.xml | 6 +-
apps/android_rpc/build.gradle | 4 +-
apps/bundle_deploy/Makefile | 36 +-
apps/bundle_deploy/build_model.py | 2 +
apps/bundle_deploy/crt_config/crt_config.h | 53 -
apps/cpp_clml/CMakeLists.txt | 61 +
apps/cpp_clml/README.md | 145 ++
apps/cpp_clml/clml_runner.cc | 818 +++++++++
apps/cpp_clml/clml_runner.h | 262 +++
apps/cpp_clml/main.cc | 243 +++
apps/cpp_clml/scripts/clml_codegen.py | 64 +
apps/cpp_rtvm/CMakeLists.txt | 98 +
apps/cpp_rtvm/README.md | 368 ++++
apps/cpp_rtvm/main.cc | 273 +++
.../cpp_rtvm/scripts/download_models.py | 29 +-
apps/cpp_rtvm/tvm_runner.cc | 347 ++++
apps/cpp_rtvm/tvm_runner.h | 97 +
.../template_project/crt_config/crt_config.h | 57 -
.../template_project/launch_microtvm_api_server.sh | 28 +-
.../template_project/microtvm_api_server.py | 23 +-
.../src/example_project/{model.c => platform.c} | 25 +-
.../src/example_project/{model.h => platform.h} | 4 -
.../src/example_project/project.ino | 4 +-
.../host_driven/{model_support.c => platform.c} | 12 +-
apps/microtvm/cmsisnn/Makefile | 10 +-
apps/microtvm/cmsisnn/convert_image.py | 8 +
apps/microtvm/cmsisnn/include/crt_config.h | 26 -
apps/microtvm/ethosu/Makefile | 11 +-
apps/microtvm/ethosu/convert_labels.py | 7 +
apps/microtvm/ethosu/include/crt_config.h | 26 -
.../template_project/CMakeLists.txt.template | 39 +-
.../template_project/crt_config/crt_config.h | 67 -
.../template_project/launch_microtvm_api_server.sh | 19 +-
.../zephyr/template_project/microtvm_api_server.py | 110 +-
.../src/aot_standalone_demo/main.c | 196 +-
.../src/aot_standalone_demo/platform.c | 126 ++
.../src/aot_standalone_demo/zephyr_uart.c | 87 -
.../src/host_driven/fvp/semihost.c | 2 +-
.../zephyr/template_project/src/host_driven/main.c | 166 +-
.../template_project/src/host_driven/platform.c | 155 ++
.../zephyr_uart.h => mlperftiny/platform.cc} | 53 +-
.../src/mlperftiny/submitter_implemented.cc | 175 +-
.../template_project/src/mlperftiny/tvmruntime.cc | 164 --
.../template_project/src/mlperftiny/tvmruntime.h | 62 -
.../template_project/src/mlperftiny/zephyr_uart.cc | 89 -
.../template_project/src/mlperftiny/zephyr_uart.h | 51 -
apps/microtvm/zephyr_cmsisnn/CMakeLists.txt | 47 +-
apps/microtvm/zephyr_cmsisnn/include/crt_config.h | 54 -
.../zephyr_cmsisnn/model/convert_labels.py | 14 +
apps/microtvm/zephyr_cmsisnn/src/main.c | 4 +-
apps/pt_tvmdsoop/tests/test_as_torch.py | 2 +-
apps/pt_tvmdsoop/tests/test_boolean_tensor.py | 8 +-
ci/jenkins/docker-images.ini | 12 +-
ci/jenkins/generated/arm_jenkinsfile.groovy | 6 +-
ci/jenkins/generated/cortexm_jenkinsfile.groovy | 4 +-
ci/jenkins/generated/cpu_jenkinsfile.groovy | 6 +-
ci/jenkins/generated/docker_jenkinsfile.groovy | 4 +-
ci/jenkins/generated/gpu_jenkinsfile.groovy | 4 +-
ci/jenkins/generated/hexagon_jenkinsfile.groovy | 4 +-
ci/jenkins/generated/i386_jenkinsfile.groovy | 6 +-
ci/jenkins/generated/lint_jenkinsfile.groovy | 4 +-
...groovy => minimal_cross_isa_jenkinsfile.groovy} | 43 +-
ci/jenkins/generated/minimal_jenkinsfile.groovy | 6 +-
ci/jenkins/generated/riscv_jenkinsfile.groovy | 4 +-
ci/jenkins/generated/wasm_jenkinsfile.groovy | 4 +-
ci/jenkins/templates/arm_jenkinsfile.groovy.j2 | 2 +-
ci/jenkins/templates/cpu_jenkinsfile.groovy.j2 | 2 +-
ci/jenkins/templates/i386_jenkinsfile.groovy.j2 | 2 +-
....j2 => minimal_cross_isa_jenkinsfile.groovy.j2} | 29 +-
ci/jenkins/templates/minimal_jenkinsfile.groovy.j2 | 2 +-
ci/jenkins/templates/utils/Build.groovy.j2 | 2 +-
ci/scripts/github/github_tvmbot.py | 1 +
ci/scripts/jenkins/s3.py | 3 +
cmake/config.cmake | 18 +
cmake/libs/Libbacktrace.cmake | 4 -
cmake/modules/Arduino.cmake | 79 -
cmake/modules/LibInfo.cmake | 1 +
cmake/modules/Logging.cmake | 106 +-
cmake/modules/Micro.cmake | 102 ++
cmake/modules/StandaloneCrt.cmake | 103 +-
cmake/modules/Zephyr.cmake | 4 +-
cmake/modules/contrib/CLML.cmake | 2 +-
.../utils/CRTConfig.cmake | 22 +-
cmake/utils/FindLLVM.cmake | 12 +-
docker/Dockerfile.ci_adreno | 8 +-
docker/Dockerfile.ci_arm | 4 +-
docker/Dockerfile.ci_cortexm | 7 +-
docker/Dockerfile.ci_cpu | 3 -
docker/Dockerfile.ci_hexagon | 2 -
docker/Dockerfile.ci_riscv | 8 +-
docker/Dockerfile.demo_android | 2 -
docker/build.sh | 13 +
.../ubuntu_download_arm_compute_lib_binaries.sh | 2 +-
docker/install/ubuntu_init_zephyr_project.sh | 2 +-
docker/install/ubuntu_install_androidsdk.sh | 43 +-
.../install/ubuntu_install_ethosu_driver_stack.sh | 3 -
docker/install/ubuntu_install_hexagon.sh | 5 +-
docker/install/ubuntu_install_llvm_from_source.sh | 100 +
docker/install/ubuntu_install_nnpack.sh | 8 +-
docker/install/ubuntu_install_paddle.sh | 2 +-
docker/install/ubuntu_install_zephyr.sh | 31 +-
docker/install/ubuntu_install_zephyr_sdk.sh | 19 +-
docker/python/ci-constraints.txt | 2 +-
docker/with_the_same_user | 1 +
docs/conf.py | 19 +-
docs/dev/how_to/debugging_tvm.rst | 2 +-
docs/install/from_source.rst | 4 +-
docs/script_convert.py | 28 +-
docs/topic/microtvm/index.rst | 11 +-
.../how_to/work_with_microtvm/install_zephyr.rst | 14 +-
gallery/how_to/work_with_microtvm/micro_aot.py | 21 +-
.../how_to/work_with_microtvm/micro_autotune.py | 13 +-
gallery/how_to/work_with_microtvm/micro_ethosu.py | 6 +-
.../how_to/work_with_microtvm/micro_mlperftiny.py | 304 ++++
gallery/how_to/work_with_microtvm/micro_pytorch.py | 20 +-
.../work_with_microtvm/micro_reference_vm.py | 159 --
gallery/how_to/work_with_microtvm/micro_tflite.py | 72 +-
gallery/how_to/work_with_microtvm/micro_train.py | 14 +-
gallery/how_to/work_with_microtvm/micro_tvmc.sh | 76 +-
.../work_with_relay/using_pipeline_executor.py | 8 +-
gallery/tutorial/autotvm_relay_x86.py | 2 +-
gallery/tutorial/tvmc_command_line_driver.py | 2 +-
include/tvm/ir/diagnostic.h | 29 +-
include/tvm/ir/expr.h | 4 +-
include/tvm/ir/module.h | 82 +-
include/tvm/ir/{span.h => source_map.h} | 96 +-
include/tvm/ir/transform.h | 1 -
include/tvm/ir/type.h | 21 +-
include/tvm/ir/type_functor.h | 4 +-
include/tvm/meta_schedule/database.h | 7 +-
include/tvm/meta_schedule/mutator.h | 2 -
include/tvm/meta_schedule/postproc.h | 4 +-
include/tvm/meta_schedule/schedule_rule.h | 4 +-
include/tvm/node/reflection.h | 2 +-
include/tvm/node/repr_printer.h | 35 +
include/tvm/node/script_printer.h | 121 ++
include/tvm/node/structural_equal.h | 11 +-
include/tvm/node/structural_hash.h | 74 +-
include/tvm/parser/source_map.h | 119 --
include/tvm/relay/attrs/transform.h | 32 +-
include/tvm/relay/base.h | 30 +-
include/tvm/{ir => relay}/error.h | 16 +-
include/tvm/relay/expr.h | 3 +-
include/tvm/relay/expr_functor.h | 2 +-
include/tvm/{parser => relay}/parser.h | 16 +-
include/tvm/relay/pattern_functor.h | 2 +-
include/tvm/relay/transform.h | 9 +-
include/tvm/runtime/c_runtime_api.h | 7 +
include/tvm/runtime/container/string.h | 73 +-
include/tvm/runtime/crt/aot_executor.h | 10 +
include/tvm/runtime/crt/microtvm_rpc_server.h | 2 +
include/tvm/runtime/crt/platform.h | 9 +
include/tvm/runtime/metadata_base.h | 5 +-
include/tvm/runtime/ndarray.h | 8 -
include/tvm/runtime/packed_func.h | 4 +
include/tvm/script/ir_builder/tir/ir.h | 44 +-
include/tvm/script/printer/doc.h | 88 +-
include/tvm/script/printer/ir_docsifier.h | 65 +-
include/tvm/script/printer/ir_docsifier_functor.h | 3 +
include/tvm/script/printer/printer.h | 91 -
include/tvm/target/virtual_device.h | 18 +-
include/tvm/tir/analysis.h | 39 +-
include/tvm/tir/builtin.h | 12 +
include/tvm/tir/expr.h | 3 -
include/tvm/tir/function.h | 17 +-
include/tvm/tir/index_map.h | 23 +-
include/tvm/tir/op.h | 9 +
include/tvm/tir/op_attr_types.h | 21 +
include/tvm/tir/schedule/schedule.h | 43 +-
include/tvm/tir/stmt.h | 2 +
include/tvm/tir/transform.h | 10 +-
include/tvm/tir/var.h | 7 +
include/tvm/topi/elemwise.h | 49 +-
include/tvm/topi/nn.h | 28 +
include/tvm/topi/nn/group_norm.h | 151 ++
include/tvm/topi/nn/pooling.h | 44 +-
include/tvm/topi/transform.h | 16 +-
python/gen_requirements.py | 4 +-
python/tvm/_ffi/libinfo.py | 8 +-
python/tvm/auto_scheduler/compute_dag.py | 0
python/tvm/auto_scheduler/cost_model/xgb_model.py | 202 ++-
python/tvm/auto_scheduler/relay_integration.py | 8 +-
python/tvm/autotvm/measure/measure_methods.py | 2 +-
python/tvm/autotvm/tophub.py | 10 +
python/tvm/autotvm/tuner/xgboost_cost_model.py | 201 ++-
python/tvm/{parser => contrib/cutlass}/_ffi_api.py | 5 +-
python/tvm/contrib/cutlass/build.py | 65 +-
python/tvm/contrib/cutlass/conv2d_operation.py | 186 ++
python/tvm/contrib/cutlass/conv2d_profiler.py | 6 +-
python/tvm/contrib/cutlass/gemm_operation.py | 89 +
python/tvm/contrib/cutlass/gen_gemm.py | 8 +-
python/tvm/contrib/cutlass/gen_tensor_op.py | 232 ++-
python/tvm/contrib/emcc.py | 13 +-
python/tvm/contrib/hexagon/hexagon_profiler.py | 0
python/tvm/contrib/hexagon/meta_schedule.py | 11 +-
.../micro/meta_schedule/rpc_runner_micro.py | 108 +-
python/tvm/contrib/rocm.py | 4 +-
python/tvm/driver/tvmc/autotuner.py | 3 +-
python/tvm/driver/tvmc/compiler.py | 54 +-
python/tvm/driver/tvmc/composite_target.py | 5 +
python/tvm/driver/tvmc/model.py | 12 +
python/tvm/driver/tvmc/runner.py | 10 +-
python/tvm/driver/tvmc/target.py | 2 +-
python/tvm/ir/__init__.py | 55 +-
python/tvm/ir/affine_type.py | 7 +-
python/tvm/ir/base.py | 102 +-
python/tvm/ir/expr.py | 34 +-
python/tvm/ir/module.py | 84 +-
python/tvm/ir/op.py | 31 +-
python/tvm/ir/tensor_type.py | 7 +-
python/tvm/ir/type.py | 6 +-
python/tvm/meta_schedule/database/database.py | 2 +-
python/tvm/meta_schedule/runner/rpc_runner.py | 6 +-
.../tvm/meta_schedule/space_generator/__init__.py | 2 +
.../tvm/meta_schedule/testing/space_generation.py | 2 +-
python/tvm/meta_schedule/tune_context.py | 3 +-
python/tvm/micro/__init__.py | 2 +
python/tvm/micro/build.py | 27 +-
python/tvm/micro/contrib/stm32/__init__.py | 40 +-
python/tvm/micro/model_library_format.py | 21 +-
python/tvm/micro/project_api/client.py | 14 +-
python/tvm/micro/session.py | 18 +
python/tvm/micro/testing/pytest_plugin.py | 7 +-
python/tvm/micro/testing/utils.py | 52 +-
python/tvm/parser.py | 53 +
python/tvm/relay/__init__.py | 4 +
.../_ffi_api.py => relay/_ffi_api_parser.py} | 5 +-
python/tvm/relay/backend/contrib/ethosu/codegen.py | 1 +
.../tvm/relay/backend/contrib/ethosu/legalize.py | 82 +
.../contrib/ethosu/op/binary_elementwise.py | 18 +
.../contrib/ethosu/te/binary_elementwise.py | 25 +-
.../tvm/relay/backend/contrib/ethosu/te/pooling.py | 11 +-
.../contrib/ethosu/tir/binary_elementwise.py | 33 +-
.../tvm/relay/backend/contrib/ethosu/tir/spec.py | 12 +
.../backend/contrib/ethosu/tir_to_cs_translator.py | 12 +-
python/tvm/relay/base.py | 39 +-
python/tvm/relay/build_module.py | 2 +-
python/tvm/relay/dataflow_pattern/__init__.py | 30 +
python/tvm/relay/expr.py | 46 +-
python/tvm/relay/frontend/common.py | 4 +
python/tvm/relay/frontend/keras.py | 2 +
python/tvm/relay/frontend/oneflow.py | 2 +-
python/tvm/relay/frontend/onnx.py | 1027 ++++++++---
python/tvm/relay/frontend/paddlepaddle.py | 190 +-
python/tvm/relay/frontend/pytorch.py | 533 +++++-
python/tvm/relay/frontend/pytorch_utils.py | 2 +-
python/tvm/relay/frontend/qnn_torch.py | 4 +-
python/tvm/relay/frontend/tensorflow_ops.py | 30 +-
python/tvm/relay/frontend/tflite.py | 7 +-
python/tvm/relay/function.py | 35 +-
python/tvm/relay/op/_transform.py | 42 +-
python/tvm/relay/op/contrib/clml.py | 790 +++++++-
python/tvm/relay/op/contrib/cmsisnn.py | 31 +-
python/tvm/relay/op/contrib/cutlass.py | 8 +-
python/tvm/relay/op/contrib/dnnl.py | 23 +-
python/tvm/relay/op/contrib/ethosn.py | 1 +
python/tvm/relay/op/contrib/ethosu.py | 230 ++-
python/tvm/relay/op/contrib/tensorrt.py | 16 +-
python/tvm/relay/op/nn/_nn.py | 17 -
python/tvm/relay/op/op_attrs.py | 11 +-
python/tvm/relay/op/strategy/cuda.py | 55 +-
python/tvm/relay/op/strategy/generic.py | 61 +-
python/tvm/relay/op/strategy/rocm.py | 14 +-
python/tvm/relay/op/transform.py | 81 +-
python/tvm/{parser/__init__.py => relay/parser.py} | 22 +-
python/tvm/relay/qnn/op/_qnn.py | 71 +-
python/tvm/relay/qnn/op/legalizations.py | 140 +-
python/tvm/relay/qnn/op/qnn.py | 66 +-
python/tvm/relay/qnn/strategy/arm_cpu.py | 73 +-
python/tvm/relay/qnn/strategy/generic.py | 6 +
python/tvm/relay/qnn/strategy/hexagon.py | 18 +
python/tvm/relay/testing/py_converter.py | 83 +-
python/tvm/relay/transform/infer_layout_utils.py | 0
python/tvm/relay/transform/mixed_precision.py | 2 +
python/tvm/runtime/__init__.py | 8 +-
python/tvm/runtime/_ffi_node_api.py | 5 +
python/tvm/runtime/executor/aot_executor.py | 19 +
python/tvm/runtime/object.py | 19 +-
python/tvm/runtime/object_generic.py | 15 +-
python/tvm/runtime/object_path.py | 2 +
python/tvm/runtime/params.py | 49 +-
python/tvm/runtime/script_printer.py | 282 +++
python/tvm/script/__init__.py | 1 -
python/tvm/script/highlight.py | 13 +-
python/tvm/script/ir_builder/tir/__init__.py | 1 +
python/tvm/script/ir_builder/tir/frame.py | 8 +-
python/tvm/script/ir_builder/tir/ir.py | 191 +-
python/tvm/script/parser/core/diagnostics.py | 5 +-
python/tvm/script/parser/core/entry.py | 2 +
python/tvm/script/parser/core/utils.py | 1 -
python/tvm/script/parser/ir/parser.py | 11 +
python/tvm/script/parser/tir/__init__.py | 12 +-
python/tvm/script/parser/tir/entry.py | 20 +-
python/tvm/script/parser/tir/operation.py | 28 +-
python/tvm/script/parser/tir/parser.py | 7 +-
python/tvm/script/printer/__init__.py | 2 -
python/tvm/script/printer/default.py | 83 -
python/tvm/script/printer/doc.py | 52 +-
python/tvm/script/printer/doc_printer.py | 19 +-
python/tvm/script/printer/printer.py | 54 -
python/tvm/{topi/x86/utils.py => target/x86.py} | 21 +-
python/tvm/te/operation.py | 14 +-
python/tvm/testing/aot.py | 74 +-
python/tvm/testing/utils.py | 33 +-
python/tvm/tir/__init__.py | 1 +
python/tvm/tir/buffer.py | 16 +-
python/tvm/tir/expr.py | 17 +-
python/tvm/tir/function.py | 52 +-
python/tvm/tir/op.py | 85 +
python/tvm/tir/schedule/_type_checker.py | 33 +-
python/tvm/tir/schedule/schedule.py | 345 +++-
python/tvm/tir/schedule/trace.py | 5 +-
python/tvm/tir/stmt.py | 8 +-
python/tvm/tir/tensor_intrin/cuda.py | 208 ++-
python/tvm/tir/tensor_intrin/hexagon.py | 54 +
python/tvm/tir/tensor_intrin/x86.py | 42 +-
python/tvm/topi/__init__.py | 4 +-
python/tvm/topi/adreno/reduction.py | 103 +-
python/tvm/topi/adreno/utils.py | 19 +
python/tvm/topi/arm_cpu/__init__.py | 3 +-
python/tvm/topi/arm_cpu/conv2d_spatial_pack.py | 76 +-
.../dsp/micro_kernel/multi_channel_convolve.py | 58 +-
.../arm_cpu/mprofile/dsp/micro_kernel/tensordot.py | 9 +-
python/tvm/topi/arm_cpu/qnn.py | 358 +++-
python/tvm/topi/arm_cpu/qnn_alter_op.py | 228 ++-
python/tvm/topi/arm_cpu/qnn_legalize.py | 382 ++++
python/tvm/topi/cuda/__init__.py | 3 +-
python/tvm/topi/cuda/reduction.py | 6 +-
python/tvm/topi/cuda/scatter.py | 588 +-----
python/tvm/topi/cuda/scatter_elements.py | 314 ++++
python/tvm/topi/cuda/{stft.py => signal.py} | 96 +
python/tvm/topi/generic/search.py | 32 -
python/tvm/topi/hexagon/qnn/__init__.py | 18 +-
python/tvm/topi/hexagon/qnn/adaptive_avg_pool1d.py | 240 +--
.../tvm/topi/hexagon/qnn/dense_alter_op.py | 28 +-
python/tvm/topi/hexagon/qnn/global_avg_pool2d.py | 190 +-
python/tvm/topi/hexagon/qnn/nn.py | 218 ++-
python/tvm/topi/hexagon/qnn/qadd_qsub_qmul.py | 540 +++---
python/tvm/topi/hexagon/qnn/qdense.py | 193 ++
python/tvm/topi/hexagon/qnn/quantize.py | 160 +-
python/tvm/topi/hexagon/resize2d.py | 232 +--
python/tvm/topi/hexagon/slice_ops/__init__.py | 1 +
.../hexagon/slice_ops/add_subtract_multiply.py | 174 +-
python/tvm/topi/hexagon/slice_ops/clip.py | 0
python/tvm/topi/hexagon/slice_ops/conv2d.py | 2 +-
python/tvm/topi/hexagon/slice_ops/dense.py | 144 ++
python/tvm/topi/hexagon/slice_ops/dwconv2d.py | 2 +-
.../topi/hexagon/slice_ops/global_avg_pool2d.py | 104 +-
python/tvm/topi/hexagon/utils.py | 26 +-
python/tvm/topi/image/resize.py | 6 -
python/tvm/topi/math.py | 18 +-
python/tvm/topi/nn/__init__.py | 1 +
python/tvm/topi/nn/batch_norm.py | 46 +-
python/tvm/topi/nn/conv2d.py | 6 +-
python/tvm/topi/nn/group_norm.py | 52 +
python/tvm/topi/nn/qnn.py | 54 +-
python/tvm/topi/scatter.py | 203 +--
python/tvm/topi/scatter_add.py | 198 --
python/tvm/topi/scatter_elements.py | 173 ++
python/tvm/topi/{stft.py => signal.py} | 82 +
python/tvm/topi/testing/__init__.py | 1 +
python/tvm/topi/testing/batch_norm.py | 33 +-
python/tvm/topi/testing/group_norm_python.py | 82 +
python/tvm/topi/x86/batch_matmul.py | 25 +-
python/tvm/topi/x86/conv2d_avx_1x1.py | 7 +-
python/tvm/topi/x86/conv2d_avx_common.py | 4 +-
python/tvm/topi/x86/conv2d_int8.py | 13 +-
python/tvm/topi/x86/conv3d.py | 14 +-
python/tvm/topi/x86/dense.py | 25 +-
python/tvm/topi/x86/dense_alter_op.py | 29 +-
python/tvm/topi/x86/depthwise_conv2d.py | 15 +-
python/tvm/topi/x86/group_conv2d.py | 12 +-
python/tvm/topi/x86/sparse.py | 7 +-
python/tvm/topi/x86/tensor_intrin.py | 14 +-
python/tvm/utils/roofline/cuda.py | 2 +-
python/tvm/utils/roofline/x86.py | 8 +-
rust/tvm/src/ir/expr.rs | 2 +-
rust/tvm/src/ir/module.rs | 4 +-
src/arith/analyzer.cc | 4 +
src/arith/bound_deducer.cc | 61 +-
src/arith/canonical_simplify.cc | 12 +-
src/arith/detect_linear_equation.cc | 38 +-
src/arith/iter_affine_map.cc | 12 +-
src/arith/pattern_match.h | 106 +-
src/arith/rewrite_simplify.cc | 548 +++---
src/auto_scheduler/compute_dag.cc | 22 +-
src/auto_scheduler/search_task.cc | 8 +
src/driver/driver_api.cc | 12 +
src/ir/adt.cc | 5 +-
src/ir/attrs.cc | 6 -
src/ir/diagnostic.cc | 34 +-
src/ir/expr.cc | 11 -
src/ir/function.cc | 16 +-
src/ir/module.cc | 191 +-
src/ir/{span.cc => source_map.cc} | 75 +-
src/ir/transform.cc | 10 +-
src/ir/type.cc | 25 -
src/meta_schedule/arg_info.cc | 5 +-
src/meta_schedule/database/database.cc | 15 +
src/meta_schedule/database/database_utils.cc | 2 +
src/meta_schedule/database/json_database.cc | 19 +-
src/meta_schedule/database/memory_database.cc | 37 +-
src/meta_schedule/module_equality.cc | 2 +-
src/meta_schedule/mutator/mutate_tile_size.cc | 7 +
src/meta_schedule/mutator/mutator.cc | 2 -
src/meta_schedule/postproc/postproc.cc | 2 +-
src/meta_schedule/postproc/verify_gpu_code.cc | 1 +
.../schedule_rule/multi_level_tiling.cc | 69 +-
.../schedule_rule/multi_level_tiling.h | 12 +-
.../multi_level_tiling_tensor_core.cc | 176 +-
.../multi_level_tiling_wide_vector.cc | 15 +-
src/meta_schedule/schedule_rule/schedule_rule.cc | 51 +-
.../search_strategy/evolutionary_search.cc | 2 +-
.../space_generator/space_generator.cc | 21 +-
src/meta_schedule/task_scheduler/task_scheduler.cc | 6 +-
src/meta_schedule/trace_apply.cc | 4 +-
src/meta_schedule/utils.h | 1 -
src/node/repr_printer.cc | 35 +
src/node/script_printer.cc | 100 +
src/node/structural_equal.cc | 73 +-
src/node/structural_hash.cc | 102 +-
src/parser/source_map.cc | 97 -
src/relay/analysis/annotated_region_set.cc | 2 +-
src/relay/analysis/annotated_region_set.h | 2 +-
src/relay/analysis/graph_partitioner.cc | 334 ++++
src/relay/analysis/graph_partitioner.h | 269 +++
src/relay/analysis/kind_check.cc | 2 +-
src/relay/analysis/match_exhaustion.cc | 2 +-
src/relay/analysis/type_solver.h | 2 +-
src/relay/backend/aot_executor_codegen.cc | 10 +-
src/relay/backend/build_module.cc | 2 +-
.../backend/contrib/cmsisnn/compiler_attrs.cc | 2 +
src/relay/backend/contrib/cmsisnn/compiler_attrs.h | 4 +
src/relay/backend/contrib/cmsisnn/relay_to_tir.cc | 104 +-
src/relay/backend/contrib/cmsisnn/target.cc | 1 +
.../backend/contrib/cmsisnn/tir_to_runtime.cc | 63 +-
src/relay/backend/contrib/codegen_c/codegen_c.h | 1 +
.../backend/contrib/codegen_json/codegen_json.h | 1 +
src/relay/backend/contrib/cutlass/codegen.cc | 820 ++-------
src/relay/backend/contrib/dnnl/query_layout.cc | 758 ++++----
src/relay/backend/contrib/ethosn/codegen.cc | 11 +
src/relay/backend/contrib/ethosn/codegen_ethosn.h | 1 +
src/relay/backend/contrib/ethosu/codegen.cc | 2 +-
src/relay/backend/contrib/ethosu/compiler_attrs.cc | 2 +-
src/relay/backend/contrib/ethosu/preprocess.cc | 2 +-
src/relay/backend/contrib/uma/relay_to_tir.cc | 2 +-
src/relay/backend/te_compiler_cache.cc | 48 +-
src/relay/backend/utils.cc | 3 +-
src/relay/backend/utils.h | 2 +-
src/relay/backend/vm/compiler.cc | 4 +-
src/relay/backend/vm/compiler.h | 2 +-
src/relay/collage/partition_rule.h | 2 +-
src/relay/ir/base.cc | 12 -
src/{ => relay}/ir/error.cc | 18 +-
src/relay/ir/function.cc | 150 ++
src/relay/op/contrib/ethosu/binary_elementwise.cc | 6 +-
src/relay/op/contrib/ethosu/op_attrs.h | 14 +-
src/relay/op/contrib/ethosu/pooling.cc | 23 +-
src/relay/op/nn/nn.h | 5 +
src/relay/op/tensor/transform.cc | 160 +-
src/relay/op/tensor/transform.h | 2 +-
src/relay/op/type_relations.h | 2 +-
src/{ => relay}/parser/meta_ref.cc | 4 +-
src/{ => relay}/parser/meta_ref.h | 14 +-
src/{ => relay}/parser/op_table.h | 15 +-
src/{ => relay}/parser/parser.cc | 51 +-
src/{ => relay}/parser/span_check.cc | 6 +-
src/{ => relay}/parser/span_check.h | 11 +-
src/{ => relay}/parser/token.h | 29 +-
src/{ => relay}/parser/tokenizer.h | 33 +-
src/{ => relay}/printer/doc.cc | 4 +-
src/{ => relay}/printer/doc.h | 9 +-
src/{ => relay}/printer/meta_data.h | 13 +-
.../printer/model_library_format_printer.cc | 12 +-
src/{ => relay}/printer/relay_text_printer.cc | 11 +-
src/{ => relay}/printer/text_printer.cc | 9 +-
src/{ => relay}/printer/text_printer.h | 49 +-
src/{ => relay}/printer/tir_text_printer.cc | 28 +-
src/{ => relay}/printer/tir_text_printer_debug.cc | 4 +-
src/{ => relay}/printer/tir_text_printer_debug.h | 10 +-
src/{ => relay}/printer/tvmscript_printer.cc | 95 +-
src/relay/qnn/op/dense.cc | 105 +-
src/relay/qnn/op/requantize.cc | 4 +-
src/relay/qnn/op/requantize_config.h | 4 +-
src/relay/transforms/annotate_texture_storage.cc | 10 +-
src/relay/transforms/defunctionalization.cc | 2 +-
src/relay/transforms/eliminate_common_subexpr.cc | 28 +-
src/relay/transforms/fuse_ops.cc | 516 +-----
src/relay/transforms/merge_compiler_regions.cc | 2 +-
src/relay/transforms/partition_graph.cc | 2 +-
src/relay/transforms/simplify_expr.cc | 23 +
src/runtime/aot_executor/aot_executor.cc | 8 +
src/runtime/aot_executor/aot_executor.h | 7 +
src/runtime/c_runtime_api.cc | 2 +-
src/runtime/const_loader_module.cc | 10 +
src/runtime/container.cc | 1 +
src/runtime/contrib/clml/clml_runtime.cc | 329 ++--
src/runtime/contrib/json/json_runtime.h | 3 +
src/runtime/contrib/random/mt_random_engine.cc | 10 +-
src/runtime/contrib/tensorrt/tensorrt_ops.cc | 18 +-
src/runtime/crt/CMakeLists.txt | 51 +
src/runtime/crt/Makefile | 85 -
src/runtime/crt/aot_executor/aot_executor.c | 19 +-
.../crt/aot_executor_module/aot_executor_module.c | 24 +-
src/runtime/crt/common/crt_runtime_api.c | 8 +-
...crt_config-template.h => crt_config.h.template} | 31 +-
src/runtime/crt/host/CMakeLists.txt.template | 59 +
src/runtime/crt/host/Makefile.template | 87 -
src/runtime/crt/host/main.cc | 78 +-
src/runtime/crt/host/microtvm_api_server.py | 86 +-
src/runtime/crt/host/{main.cc => platform.cc} | 124 +-
src/runtime/crt/platform-template.c | 80 +
src/runtime/dso_library.cc | 5 +
src/runtime/file_utils.cc | 12 +
src/runtime/file_utils.h | 37 +
src/runtime/graph_executor/graph_executor.cc | 44 +-
src/runtime/graph_executor/graph_executor.h | 6 +
src/runtime/hexagon/hexagon_buffer.cc | 44 +-
src/runtime/library_module.cc | 5 -
src/runtime/micro/crt_config.h | 58 -
src/runtime/opencl/opencl_common.h | 66 +-
src/runtime/opencl/opencl_device_api.cc | 155 +-
src/runtime/opencl/opencl_module.cc | 88 +-
.../opencl/opencl_wrapper/opencl_wrapper.cc | 12 +
src/runtime/profiling.cc | 1 -
src/runtime/threading_backend.cc | 9 +-
src/runtime/vm/executable.cc | 22 +-
src/script/ir_builder/tir/ir.cc | 75 +-
src/script/ir_builder/tir/utils.h | 1 +
src/script/printer/doc.cc | 48 +-
src/script/printer/doc_printer/base_doc_printer.cc | 106 +-
src/script/printer/doc_printer/base_doc_printer.h | 50 +-
.../printer/doc_printer/python_doc_printer.cc | 87 +-
src/script/printer/ir/ir.cc | 141 +-
src/script/printer/ir/misc.cc | 5 +-
src/script/printer/ir/utils.h | 14 +-
src/script/printer/ir_docsifier.cc | 32 +-
src/script/printer/legacy_repr.cc | 1007 +++++++++++
src/script/printer/printer.cc | 54 -
src/script/printer/tir/block.cc | 119 +-
src/script/printer/tir/buffer.cc | 176 +-
src/script/printer/tir/expr.cc | 361 ++--
src/script/printer/tir/for_loop.cc | 59 +-
src/script/printer/tir/function.cc | 150 +-
src/script/printer/tir/ir.cc | 71 +-
src/script/printer/tir/stmt.cc | 221 ++-
src/script/printer/tir/utils.h | 124 +-
src/script/printer/utils.h | 173 ++
src/support/libinfo.cc | 5 +
src/target/intrin_rule.cc | 16 +
src/target/intrin_rule.h | 3 +
src/target/llvm/codegen_llvm.cc | 6 +-
src/target/llvm/codegen_llvm.h | 1 +
src/target/source/codegen_c.cc | 4 +-
src/target/source/codegen_c.h | 6 +-
src/target/source/codegen_cuda.cc | 43 +-
src/target/source/codegen_metal.cc | 2 +-
src/target/source/codegen_source_base.cc | 3 +-
src/target/source/codegen_webgpu.cc | 555 ++++++
src/target/source/codegen_webgpu.h | 92 +
src/target/source/interface_c.cc | 3 +-
src/target/source/intrin_rule_metal.cc | 17 -
src/target/source/intrin_rule_webgpu.cc | 118 ++
src/target/source/ptx.cc | 31 +
src/target/source/ptx.h | 16 +
src/target/source/source_module.cc | 19 +-
src/target/spirv/build_vulkan.cc | 16 +-
src/target/spirv/intrin_rule_spirv.cc | 34 -
src/target/spirv/ir_builder.h | 18 +-
src/target/tag.cc | 15 +
src/target/virtual_device.cc | 12 +-
src/te/operation/create_primfunc.cc | 1 +
src/tir/analysis/block_access_region_detector.cc | 9 +-
src/tir/analysis/control_flow_graph.cc | 4 +-
src/tir/analysis/control_flow_graph.h | 9 +-
src/tir/analysis/deep_equal.cc | 1 +
src/tir/analysis/identify_memcpy.cc | 316 ++++
src/tir/analysis/oob_checker.cc | 1 -
src/tir/analysis/stmt_finding.cc | 39 +-
src/tir/analysis/var_use_def_analysis.cc | 176 ++
src/tir/analysis/var_use_def_analysis.h | 89 +
src/tir/analysis/verify_memory.cc | 2 +-
src/tir/ir/data_type_rewriter.cc | 5 +-
src/tir/ir/expr.cc | 13 +-
src/tir/ir/index_map.cc | 98 +-
src/tir/ir/legacy_printer.cc | 270 ---
src/tir/ir/stmt.cc | 10 +-
src/tir/op/builtin.cc | 69 +-
src/tir/op/op.cc | 42 +
src/tir/schedule/analysis/verify.cc | 2 +-
src/tir/schedule/concrete_schedule.cc | 40 +-
src/tir/schedule/concrete_schedule.h | 12 +-
src/tir/schedule/error.cc | 19 +-
src/tir/schedule/instruction.cc | 2 +
src/tir/schedule/ir_comparator.cc | 10 +-
src/tir/schedule/ir_comparator.h | 7 +-
src/tir/schedule/primitive.h | 45 +-
src/tir/schedule/primitive/cache_index.cc | 147 +-
src/tir/schedule/primitive/cache_read_write.cc | 724 +++++++-
src/tir/schedule/primitive/compute_inline.cc | 20 +-
.../schedule/primitive/layout_transformation.cc | 116 +-
src/tir/schedule/primitive/sampling.cc | 17 +-
src/tir/schedule/schedule.cc | 8 +-
src/tir/schedule/trace.cc | 54 +-
src/tir/schedule/traced_schedule.cc | 50 +-
src/tir/schedule/traced_schedule.h | 10 +-
src/tir/schedule/utils.h | 20 +-
src/tir/transforms/common_subexpr_elim.cc | 4 +-
src/tir/transforms/common_subexpr_elim_tools.cc | 4 +-
src/tir/transforms/inject_ptx_async_copy.cc | 156 +-
src/tir/transforms/inject_ptx_ldg32.cc | 130 ++
src/tir/transforms/install_debug_spans.cc | 6 +-
src/tir/transforms/lower_cross_thread_reduction.cc | 8 +-
src/tir/transforms/lower_intrin.cc | 10 +-
src/tir/transforms/narrow_datatype.cc | 1 -
.../plan_update_buffer_allocation_location.cc | 7 +
src/tir/transforms/primfunc_utils.cc | 4 +
src/tir/transforms/remove_no_op.cc | 5 +
src/tir/transforms/split_host_device.cc | 229 +--
src/tir/transforms/unroll_loop.cc | 64 +-
src/tir/usmp/transform/assign_pool_info.cc | 4 +-
src/topi/nn.cc | 7 +
tests/cpp-runtime/opencl/opencl_compile_to_bin.cc | 208 +++
tests/cpp-runtime/opencl/opencl_timer_test.cc | 11 +-
tests/cpp/relay/backend/aot/aot_lower_main_test.cc | 4 +-
.../cpp/relay/collage/candidate_partition_test.cc | 4 +-
tests/cpp/relay/collage/partition_rule_test.cc | 4 +-
tests/cpp/relay/df_pattern_rewrite_test.cc | 4 +-
tests/cpp/relay/ir/indexed_graph_test.cc | 6 +-
tests/cpp/relay/transforms/device_domains_test.cc | 4 +-
tests/cpp/relay/with_fields_test.cc | 6 +-
tests/lint/check_file_type.py | 3 +-
tests/lint/flake8.sh | 2 +-
tests/lint/pylint.sh | 1 +
tests/micro/arduino/test_arduino_rpc_server.py | 1 -
tests/micro/arduino/test_arduino_workflow.py | 17 +-
tests/micro/arduino/testdata/project.ino | 5 +-
tests/micro/common/test_autotune.py | 3 +
tests/micro/common/test_mlperftiny.py | 301 +++
tests/micro/common/test_tvmc.py | 138 +-
.../test_arduino_microtvm_api_server.py | 200 --
tests/micro/project_api/test_project_api.py | 33 +-
tests/micro/stm32/test_code_emitter.py | 5 +-
tests/micro/zephyr/test_ms_tuning.py | 22 +-
tests/micro/zephyr/test_zephyr.py | 160 +-
tests/micro/zephyr/test_zephyr_aot_exec.py | 16 +-
.../zephyr/test_zephyr_aot_exec_standalone.py | 63 +-
tests/micro/zephyr/test_zephyr_armv7m.py | 21 +-
tests/micro/zephyr/utils.py | 88 +-
.../test_runtime_packed_func.py | 13 +
tests/python/ci/test_script_converter.py | 21 +-
.../contrib/test_arm_compute_lib/test_conv2d.py | 395 ++--
.../contrib/test_arm_compute_lib/test_dense.py | 6 +-
tests/python/contrib/test_clml/test_ops.py | 102 ++
.../python/contrib/test_cmsisnn/test_binary_ops.py | 127 +-
tests/python/contrib/test_cmsisnn/test_conv2d.py | 1 +
.../contrib/test_cmsisnn/test_invalid_graphs.py | 4 +-
.../python/contrib/test_cmsisnn/test_last_error.py | 164 ++
tests/python/contrib/test_cmsisnn/utils.py | 6 +-
tests/python/contrib/test_cutlass.py | 166 +-
tests/python/contrib/test_dnnl.py | 0
tests/python/contrib/test_ethosn/test_addition.py | 163 +-
tests/python/contrib/test_ethosn/test_networks.py | 2 +-
.../test_ethosu_binary_elementwise_matcher.py | 3 +
tests/python/contrib/test_ethosu/infra.py | 9 +-
tests/python/contrib/test_ethosu/test_codegen.py | 137 ++
.../test_ethosu/test_copy_compute_reordering.py | 230 +--
.../contrib/test_ethosu/test_create_tiles.py | 8 +-
.../contrib/test_ethosu/test_encode_constants.py | 170 +-
.../contrib/test_ethosu/test_hoist_allocates.py | 80 +-
tests/python/contrib/test_ethosu/test_legalize.py | 206 ++-
.../contrib/test_ethosu/test_lut_optimizer.py | 42 +
.../contrib/test_ethosu/test_merge_constants.py | 324 ++--
.../test_ethosu/test_outline_compiler_functions.py | 4 +-
.../test_ethosu/test_remove_concatenates.py | 33 +-
.../test_ethosu/test_replace_binary_elementwise.py | 2 +
.../contrib/test_ethosu/test_replace_conv2d.py | 165 +-
.../contrib/test_ethosu/test_replace_copy.py | 37 +-
.../contrib/test_ethosu/test_replace_pooling.py | 64 +-
tests/python/contrib/test_ethosu/test_scheduler.py | 20 +-
.../test_ethosu/test_tir_to_cs_translator.py | 106 +-
tests/python/contrib/test_gemm_acc32_vnni.py | 160 +-
.../python/contrib/test_hexagon/infrastructure.py | 6 +
.../metaschedule_e2e/test_resnet50_int8.py | 2 +-
.../test_hexagon/test_2d_physical_buffers.py | 0
.../test_hexagon/test_async_dma_pipeline.py | 22 +-
.../contrib/test_hexagon/test_meta_schedule.py | 6 +-
.../test_hexagon/test_software_pipeline_async.py | 8 +-
tests/python/contrib/test_hexagon/test_vtcm.py | 2 +-
.../contrib/test_hexagon/test_vtcm_bandwidth.py | 46 +-
.../test_hexagon/test_wo_qnn_canonicalization.py | 172 +-
.../test_hexagon/topi/slice_op/test_dense_slice.py | 282 +++
.../topi/slice_op/test_global_avg_pool2d.py | 334 ++--
.../test_hexagon/topi/test_adaptive_avg_pool1d.py | 370 ++--
.../topi/test_add_subtract_multiply.py | 822 ++++-----
.../contrib/test_hexagon/topi/test_quantize.py | 256 +--
.../contrib/test_hexagon/topi/test_resize2d.py | 404 ++---
tests/python/contrib/test_random.py | 28 +
tests/python/contrib/test_tensorrt.py | 17 +-
tests/python/contrib/test_uma/test_partition.py | 7 +-
tests/python/driver/tvmc/test_compiler.py | 25 +-
tests/python/driver/tvmc/test_target_options.py | 23 +
tests/python/frontend/darknet/test_forward.py | 4 +-
tests/python/frontend/keras/test_forward.py | 11 +
tests/python/frontend/onnx/test_forward.py | 831 ++++++++-
tests/python/frontend/paddlepaddle/test_forward.py | 314 +++-
tests/python/frontend/pytorch/qnn_test.py | 63 +-
tests/python/frontend/pytorch/test_forward.py | 440 ++++-
tests/python/frontend/pytorch/test_fx_quant.py | 7 +-
tests/python/frontend/pytorch/test_lstm.py | 6 +-
.../frontend/pytorch/test_object_detection.py | 6 +-
tests/python/frontend/pytorch/test_rnns.py | 16 +-
tests/python/frontend/tensorflow/test_forward.py | 25 +-
tests/python/frontend/tflite/test_forward.py | 175 +-
tests/python/integration/test_auto_tensorize.py | 139 +-
tests/python/integration/test_legacy_tuning.py | 2 +-
tests/python/integration/test_lower.py | 12 +-
tests/python/relay/aot/corstone300.mk | 37 +-
.../relay/aot/test_aot_create_executor_metadata.py | 2 +-
tests/python/relay/aot/test_cpp_aot.py | 8 +-
tests/python/relay/aot/test_crt_aot.py | 5 +-
tests/python/relay/aot/test_crt_aot_usmp.py | 37 +-
tests/python/relay/aot/test_pass_aot_lower_main.py | 34 +-
tests/python/relay/backend/test_pass_lower_te.py | 8 +-
.../test_pass_remove_standalone_reshapes.py | 6 +-
tests/python/relay/collage/menangerie.py | 26 +-
tests/python/relay/collage/test_sub_graph.py | 36 +-
tests/python/relay/dyn/test_dynamic_op_level3.py | 6 +-
.../opencl_texture/test_conv2d_nhwc_texture.py | 47 +
.../relay/opencl_texture/test_reduction_texture.py | 126 ++
tests/python/relay/qnn/test_clip_legalization.py | 87 +
.../python/relay/qnn/test_qnn_channel_stripping.py | 299 +++
.../strategy/arm_cpu/test_quantized_convolution.py | 71 +-
tests/python/relay/test_any.py | 14 +
tests/python/relay/test_backend_graph_executor.py | 2 +-
tests/python/relay/test_dataflow_pattern.py | 4 +-
tests/python/relay/test_ir_parser.py | 30 +-
tests/python/relay/test_ir_text_printer.py | 6 +-
tests/python/relay/test_op_floordiv.py | 117 ++
tests/python/relay/test_op_grad_level2.py | 2 +-
tests/python/relay/test_op_level1.py | 26 +-
tests/python/relay/test_op_level10.py | 49 +-
tests/python/relay/test_op_level2.py | 24 +-
tests/python/relay/test_op_level3.py | 25 +-
tests/python/relay/test_pass_auto_quantize.py | 12 +-
tests/python/relay/test_pass_collage_partition.py | 40 +-
.../relay/test_pass_dead_code_elimination.py | 12 +-
.../python/relay/test_pass_defunctionalization.py | 6 +-
.../relay/test_pass_eliminate_common_subexpr.py | 33 +-
tests/python/relay/test_pass_eta_expand.py | 8 +-
tests/python/relay/test_pass_manifest_lifetimes.py | 4 +-
tests/python/relay/test_pass_plan_devices.py | 128 +-
tests/python/relay/test_pass_qnn_legalize.py | 118 +-
tests/python/relay/test_pass_unmatched_cases.py | 6 +-
tests/python/relay/test_py_converter.py | 65 +-
tests/python/relay/test_target_hooks.py | 2 +-
tests/python/relay/test_type_infer.py | 2 +-
tests/python/relay/test_vm.py | 4 +-
.../test_capture_postdfsindex_in_spans.py | 2 +-
.../transform/test_compiler_function_utils.py | 10 +-
tests/python/relay/utils/assert_diagnostic.py | 10 +-
tests/python/topi/python/test_topi_batch_norm.py | 46 +-
tests/python/topi/python/test_topi_clip.py | 16 +-
.../topi/python/test_topi_conv2d_tensordot_opts.py | 6 +-
tests/python/topi/python/test_topi_dft.py | 88 +
..._topi_layer_norm.py => test_topi_group_norm.py} | 30 +-
tests/python/topi/python/test_topi_layer_norm.py | 2 +-
tests/python/topi/python/test_topi_loss.py | 11 +-
tests/python/topi/python/test_topi_pooling.py | 55 +-
tests/python/topi/python/test_topi_scatter.py | 18 +-
.../unittest/test_aot_legalize_packed_call.py | 32 +-
.../unittest/test_arith_canonical_simplify.py | 14 +
tests/python/unittest/test_arith_deduce_bound.py | 58 +-
.../unittest/test_arith_detect_clip_bound.py | 13 +
tests/python/unittest/test_arith_domain_touched.py | 4 +-
tests/python/unittest/test_arith_intset.py | 11 +-
.../python/unittest/test_arith_rewrite_simplify.py | 1909 ++++++++++----------
..._dispatch_context.py => test_arith_simplify.py} | 27 +-
.../python/unittest/test_auto_scheduler_feature.py | 18 +-
.../unittest/test_autotvm_dispatch_context.py | 16 +
tests/python/unittest/test_crt.py | 222 +--
...ir_ptx_cp_async.py => test_inject_ptx_ldg32.py} | 56 +-
tests/python/unittest/test_link_params.py | 4 +-
tests/python/unittest/test_lower_build.py | 24 +-
...on.py => test_meta_schedule_cpu_dot_product.py} | 62 +-
.../python/unittest/test_meta_schedule_database.py | 59 +-
...schedule_feature_extractor_per_store_feature.py | 12 +-
.../test_meta_schedule_mutator_mutate_tile_size.py | 13 +
..._schedule_postproc_rewrite_cooperative_fetch.py | 6 +-
.../test_meta_schedule_postproc_rewrite_layout.py | 48 +-
...e_postproc_rewrite_parallel_vectorize_unroll.py | 14 +-
...ta_schedule_postproc_rewrite_reduction_block.py | 2 +-
...est_meta_schedule_postproc_rewrite_tensorize.py | 26 +-
...meta_schedule_postproc_rewrite_unbound_block.py | 16 +-
.../test_meta_schedule_postproc_verify_gpu_code.py | 24 +-
...est_meta_schedule_postproc_verify_vtcm_limit.py | 2 +-
.../test_meta_schedule_relay_integration.py | 115 +-
tests/python/unittest/test_meta_schedule_runner.py | 41 +-
...test_meta_schedule_schedule_rule_add_rfactor.py | 50 +-
.../test_meta_schedule_schedule_rule_auto_bind.py | 28 +-
...test_meta_schedule_schedule_rule_auto_inline.py | 20 +-
...chedule_schedule_rule_cross_thread_reduction.py | 72 +-
.../test_meta_schedule_schedule_rule_mlt.py | 177 +-
.../test_meta_schedule_schedule_rule_mlt_intrin.py | 35 +-
.../test_meta_schedule_schedule_rule_mlt_tc.py | 846 ++++-----
...dule_schedule_rule_parallel_vectorize_unroll.py | 8 +-
...hedule_schedule_rule_random_compute_location.py | 4 +-
.../unittest/test_meta_schedule_space_cpu.py | 90 +-
.../test_meta_schedule_space_cpu_winograd.py | 2 +-
.../unittest/test_meta_schedule_space_cuda.py | 42 +-
.../test_meta_schedule_space_cuda_async.py | 340 ++++
...ule_space_cuda_async_multiple_initialization.py | 88 +
.../test_meta_schedule_space_cuda_winograd.py | 10 +-
.../unittest/test_meta_schedule_trace_apply.py | 116 +-
.../unittest/test_micro_model_library_format.py | 40 +-
tests/python/unittest/test_micro_ms_tuning.py | 9 +-
tests/python/unittest/test_runtime_graph.py | 16 +-
tests/python/unittest/test_slice_tir.py | 2 +-
tests/python/unittest/test_target_codegen_llvm.py | 2 +-
.../python/unittest/test_target_codegen_vulkan.py | 2 +-
tests/python/unittest/test_target_codegen_x86.py | 2 +-
tests/python/unittest/test_te_create_primfunc.py | 110 +-
tests/python/unittest/test_te_schedule.py | 30 +-
...test_tir_analysis_calculate_allocated_memory.py | 4 +-
.../test_tir_analysis_detect_buffer_access_lca.py | 2 +-
.../test_tir_analysis_estimate_tir_flops.py | 4 +-
.../unittest/test_tir_analysis_identify_memcpy.py | 324 ++++
tests/python/unittest/test_tir_analysis_oob.py | 12 +-
.../test_tir_analysis_verify_well_formed.py | 8 +-
tests/python/unittest/test_tir_buffer.py | 2 +-
tests/python/unittest/test_tir_constructor.py | 3 +-
tests/python/unittest/test_tir_host_func.py | 79 +
tests/python/unittest/test_tir_imm_values.py | 8 +-
tests/python/unittest/test_tir_intrin.py | 10 +-
.../python/unittest/test_tir_lower_match_buffer.py | 26 +-
tests/python/unittest/test_tir_nodes.py | 180 +-
tests/python/unittest/test_tir_op_types.py | 13 +
tests/python/unittest/test_tir_ptx_cp_async.py | 2 +-
tests/python/unittest/test_tir_ptx_ldmatrix.py | 2 +-
tests/python/unittest/test_tir_renew_defs.py | 12 +-
.../python/unittest/test_tir_schedule_analysis.py | 33 +-
.../python/unittest/test_tir_schedule_blockize.py | 44 +-
.../unittest/test_tir_schedule_cache_index.py | 404 ++++-
.../unittest/test_tir_schedule_cache_read_write.py | 221 ++-
.../unittest/test_tir_schedule_compute_at.py | 62 +-
.../unittest/test_tir_schedule_compute_inline.py | 144 +-
.../test_tir_schedule_decompose_padding.py | 21 +-
.../python/unittest/test_tir_schedule_for_kind.py | 30 +-
.../unittest/test_tir_schedule_pad_einsum.py | 12 +-
tests/python/unittest/test_tir_schedule_reindex.py | 54 +-
tests/python/unittest/test_tir_schedule_reorder.py | 12 +-
tests/python/unittest/test_tir_schedule_rfactor.py | 164 +-
.../unittest/test_tir_schedule_rolling_buffer.py | 26 +-
.../python/unittest/test_tir_schedule_sampling.py | 6 +-
.../test_tir_schedule_set_axis_separator.py | 10 +-
.../python/unittest/test_tir_schedule_set_scope.py | 8 +-
.../unittest/test_tir_schedule_split_fuse.py | 24 +-
.../test_tir_schedule_state_cached_flags.py | 6 +-
.../python/unittest/test_tir_schedule_tensorize.py | 86 +-
tests/python/unittest/test_tir_schedule_trace.py | 27 +-
.../python/unittest/test_tir_schedule_transform.py | 62 +-
.../unittest/test_tir_schedule_transform_layout.py | 176 +-
.../python/unittest/test_tir_schedule_utilities.py | 10 +-
tests/python/unittest/test_tir_specialize.py | 18 +-
.../python/unittest/test_tir_te_extern_primfunc.py | 24 +-
.../test_tir_transform_common_subexpr_elim.py | 15 +-
.../test_tir_transform_compact_buffer_region.py | 42 +-
.../test_tir_transform_convert_blocks_to_opaque.py | 2 +-
.../test_tir_transform_extract_constants.py | 6 +-
.../unittest/test_tir_transform_flatten_buffer.py | 76 +-
.../python/unittest/test_tir_transform_helpers.py | 6 +-
.../test_tir_transform_hoist_expression.py | 83 +-
.../test_tir_transform_inject_ptx_async_copy.py | 230 ++-
.../test_tir_transform_inject_rolling_buffer.py | 16 +-
.../test_tir_transform_inject_software_pipeline.py | 88 +-
.../test_tir_transform_inject_virtual_thread.py | 16 +-
.../unittest/test_tir_transform_loop_partition.py | 74 +-
...t_tir_transform_lower_cross_thread_reduction.py | 78 +-
.../test_tir_transform_lower_opaque_block.py | 10 +-
.../test_tir_transform_lower_warp_memory.py | 9 +-
.../unittest/test_tir_transform_make_packed_api.py | 16 +-
...transform_manifest_shared_memory_local_stage.py | 4 +-
.../unittest/test_tir_transform_narrow_datatype.py | 10 +-
...sform_plan_update_buffer_allocation_location.py | 26 +-
...ansform_reduce_branching_through_overcompute.py | 26 +-
.../unittest/test_tir_transform_remove_assume.py | 8 +-
.../unittest/test_tir_transform_remove_no_op.py | 106 +-
.../unittest/test_tir_transform_remove_undef.py | 18 +-
...transform_remove_weight_layout_rewrite_block.py | 12 +-
...test_tir_transform_renormalize_split_pattern.py | 24 +-
.../python/unittest/test_tir_transform_simplify.py | 248 +--
.../unittest/test_tir_transform_storage_flatten.py | 2 +-
.../unittest/test_tir_transform_storage_rewrite.py | 34 +-
.../unittest/test_tir_transform_thread_sync.py | 8 +-
.../test_tir_transform_unify_thread_binding.py | 12 +-
.../unittest/test_tir_transform_unroll_loop.py | 42 +
...ransform_convert_pool_allocations_to_offsets.py | 116 +-
tests/python/unittest/test_transform_layout.py | 0
.../unittest/test_tvm_testing_before_after.py | 10 +-
tests/python/unittest/test_tvmscript_complete.py | 2 +-
.../python/unittest/test_tvmscript_error_report.py | 26 +-
.../unittest/test_tvmscript_ir_builder_tir.py | 50 +-
tests/python/unittest/test_tvmscript_ops.py | 10 +-
.../unittest/test_tvmscript_parser_source.py | 15 +
tests/python/unittest/test_tvmscript_parser_tir.py | 6 +-
.../unittest/test_tvmscript_printer_annotation.py | 86 +
.../python/unittest/test_tvmscript_printer_doc.py | 28 +
.../unittest/test_tvmscript_printer_highlight.py | 29 +
tests/python/unittest/test_tvmscript_printer_ir.py | 52 +
...ssume.py => test_tvmscript_printer_metadata.py} | 62 +-
.../test_tvmscript_printer_python_doc_printer.py | 62 +-
.../test_tvmscript_printer_structural_equal.py | 148 ++
.../python/unittest/test_tvmscript_printer_tir.py | 353 +++-
.../unittest/test_tvmscript_printer_underlining.py | 211 ++-
tests/python/unittest/test_tvmscript_regression.py | 23 +-
tests/python/unittest/test_tvmscript_roundtrip.py | 342 ++--
.../python/unittest/test_tvmscript_syntax_sugar.py | 50 +-
tests/scripts/request_hook/request_hook.py | 16 +-
tests/scripts/setup-pytest-env.sh | 2 +
tests/scripts/task_build_adreno_bins.sh | 3 +-
tests/scripts/task_config_build_arm.sh | 2 +-
tests/scripts/task_config_build_cpu.sh | 2 +-
tests/scripts/task_config_build_hexagon.sh | 5 +-
tests/scripts/task_config_build_minimal.sh | 2 +-
...l.sh => task_config_build_minimal_cross_isa.sh} | 21 +-
...uild_minimal.sh => task_config_build_static.sh} | 20 +-
tests/scripts/task_demo_microtvm.sh | 7 +-
tests/scripts/task_python_microtvm.sh | 7 +
web/.eslintignore | 1 +
web/.gitignore | 1 +
web/Makefile | 11 +-
web/README.md | 2 +-
web/apps/node/example.js | 2 +
web/emcc/tvmjs_support.cc | 4 +-
web/emcc/wasm_runtime.cc | 2 +-
web/emcc/webgpu_runtime.cc | 24 +-
web/src/ctypes.ts | 22 +
web/src/index.ts | 5 +-
web/src/rpc_server.ts | 19 +-
web/src/runtime.ts | 646 +++++--
web/src/webgpu.ts | 19 +-
web/tests/node/test_module_load.js | 15 +-
web/tests/node/test_ndarray.js | 16 +-
web/tests/node/{test_ndarray.js => test_object.js} | 35 +-
web/tests/node/test_packed_func.js | 59 +-
web/tests/python/webgpu_rpc_test.py | 6 +-
web/tests/python/websock_rpc_test.py | 1 -
968 files changed, 40740 insertions(+), 18441 deletions(-)
create mode 160000 3rdparty/cnpy
delete mode 100644 apps/bundle_deploy/crt_config/crt_config.h
create mode 100644 apps/cpp_clml/CMakeLists.txt
create mode 100644 apps/cpp_clml/README.md
create mode 100644 apps/cpp_clml/clml_runner.cc
create mode 100644 apps/cpp_clml/clml_runner.h
create mode 100644 apps/cpp_clml/main.cc
create mode 100644 apps/cpp_clml/scripts/clml_codegen.py
create mode 100644 apps/cpp_rtvm/CMakeLists.txt
create mode 100644 apps/cpp_rtvm/README.md
create mode 100644 apps/cpp_rtvm/main.cc
copy tests/python/unittest/test_autotvm_dispatch_context.py =>
apps/cpp_rtvm/scripts/download_models.py (59%)
create mode 100644 apps/cpp_rtvm/tvm_runner.cc
create mode 100644 apps/cpp_rtvm/tvm_runner.h
delete mode 100644
apps/microtvm/arduino/template_project/crt_config/crt_config.h
copy docker/install/ubuntu_install_paddle.sh =>
apps/microtvm/arduino/template_project/launch_microtvm_api_server.sh (55%)
rename apps/microtvm/arduino/template_project/src/example_project/{model.c =>
platform.c} (80%)
rename apps/microtvm/arduino/template_project/src/example_project/{model.h =>
platform.h} (94%)
rename apps/microtvm/arduino/template_project/src/host_driven/{model_support.c
=> platform.c} (85%)
delete mode 100644 apps/microtvm/cmsisnn/include/crt_config.h
delete mode 100644 apps/microtvm/ethosu/include/crt_config.h
delete mode 100644
apps/microtvm/zephyr/template_project/crt_config/crt_config.h
copy docker/install/ubuntu_install_paddle.sh =>
apps/microtvm/zephyr/template_project/launch_microtvm_api_server.sh (67%)
create mode 100644
apps/microtvm/zephyr/template_project/src/aot_standalone_demo/platform.c
delete mode 100644
apps/microtvm/zephyr/template_project/src/aot_standalone_demo/zephyr_uart.c
create mode 100644
apps/microtvm/zephyr/template_project/src/host_driven/platform.c
rename
apps/microtvm/zephyr/template_project/src/{aot_standalone_demo/zephyr_uart.h =>
mlperftiny/platform.cc} (50%)
delete mode 100644
apps/microtvm/zephyr/template_project/src/mlperftiny/tvmruntime.cc
delete mode 100644
apps/microtvm/zephyr/template_project/src/mlperftiny/tvmruntime.h
delete mode 100644
apps/microtvm/zephyr/template_project/src/mlperftiny/zephyr_uart.cc
delete mode 100644
apps/microtvm/zephyr/template_project/src/mlperftiny/zephyr_uart.h
delete mode 100644 apps/microtvm/zephyr_cmsisnn/include/crt_config.h
copy ci/jenkins/generated/{minimal_jenkinsfile.groovy =>
minimal_cross_isa_jenkinsfile.groovy} (93%)
copy ci/jenkins/templates/{minimal_jenkinsfile.groovy.j2 =>
minimal_cross_isa_jenkinsfile.groovy.j2} (68%)
delete mode 100644 cmake/modules/Arduino.cmake
copy docker/install/ubuntu_install_paddle.sh => cmake/utils/CRTConfig.cmake
(53%)
mode change 100755 => 100644
create mode 100644 docker/install/ubuntu_install_llvm_from_source.sh
create mode 100644 gallery/how_to/work_with_microtvm/micro_mlperftiny.py
delete mode 100644 gallery/how_to/work_with_microtvm/micro_reference_vm.py
mode change 100755 => 100644
gallery/how_to/work_with_relay/using_pipeline_executor.py
rename include/tvm/ir/{span.h => source_map.h} (59%)
create mode 100644 include/tvm/node/script_printer.h
delete mode 100644 include/tvm/parser/source_map.h
rename include/tvm/{ir => relay}/error.h (96%)
rename include/tvm/{parser => relay}/parser.h (86%)
delete mode 100644 include/tvm/script/printer/printer.h
create mode 100644 include/tvm/topi/nn/group_norm.h
mode change 100755 => 100644 python/gen_requirements.py
mode change 100755 => 100644 python/tvm/auto_scheduler/compute_dag.py
copy python/tvm/{parser => contrib/cutlass}/_ffi_api.py (90%)
mode change 100755 => 100644 python/tvm/contrib/hexagon/hexagon_profiler.py
mode change 100755 => 100644 python/tvm/micro/contrib/stm32/__init__.py
create mode 100644 python/tvm/parser.py
rename python/tvm/{parser/_ffi_api.py => relay/_ffi_api_parser.py} (91%)
mode change 100755 => 100644 python/tvm/relay/frontend/common.py
mode change 100644 => 100755 python/tvm/relay/frontend/paddlepaddle.py
rename python/tvm/{parser/__init__.py => relay/parser.py} (71%)
mode change 100755 => 100644 python/tvm/relay/transform/infer_layout_utils.py
create mode 100644 python/tvm/runtime/script_printer.py
delete mode 100644 python/tvm/script/printer/default.py
delete mode 100644 python/tvm/script/printer/printer.py
rename python/tvm/{topi/x86/utils.py => target/x86.py} (85%)
create mode 100644 python/tvm/topi/arm_cpu/qnn_legalize.py
create mode 100644 python/tvm/topi/cuda/scatter_elements.py
rename python/tvm/topi/cuda/{stft.py => signal.py} (60%)
mode change 100755 => 100644 python/tvm/topi/hexagon/qnn/adaptive_avg_pool1d.py
copy tests/python/unittest/test_autotvm_dispatch_context.py =>
python/tvm/topi/hexagon/qnn/dense_alter_op.py (59%)
mode change 100755 => 100644 python/tvm/topi/hexagon/qnn/global_avg_pool2d.py
mode change 100755 => 100644 python/tvm/topi/hexagon/qnn/qadd_qsub_qmul.py
create mode 100644 python/tvm/topi/hexagon/qnn/qdense.py
mode change 100755 => 100644 python/tvm/topi/hexagon/qnn/quantize.py
mode change 100755 => 100644 python/tvm/topi/hexagon/resize2d.py
mode change 100755 => 100644
python/tvm/topi/hexagon/slice_ops/add_subtract_multiply.py
mode change 100755 => 100644 python/tvm/topi/hexagon/slice_ops/clip.py
create mode 100644 python/tvm/topi/hexagon/slice_ops/dense.py
mode change 100755 => 100644
python/tvm/topi/hexagon/slice_ops/global_avg_pool2d.py
create mode 100644 python/tvm/topi/nn/group_norm.py
delete mode 100644 python/tvm/topi/scatter_add.py
create mode 100644 python/tvm/topi/scatter_elements.py
rename python/tvm/topi/{stft.py => signal.py} (62%)
create mode 100644 python/tvm/topi/testing/group_norm_python.py
rename src/ir/{span.cc => source_map.cc} (61%)
create mode 100644 src/node/script_printer.cc
delete mode 100644 src/parser/source_map.cc
create mode 100644 src/relay/analysis/graph_partitioner.cc
create mode 100644 src/relay/analysis/graph_partitioner.h
rename src/{ => relay}/ir/error.cc (92%)
rename src/{ => relay}/parser/meta_ref.cc (98%)
rename src/{ => relay}/parser/meta_ref.h (92%)
rename src/{ => relay}/parser/op_table.h (93%)
rename src/{ => relay}/parser/parser.cc (99%)
rename src/{ => relay}/parser/span_check.cc (96%)
rename src/{ => relay}/parser/span_check.h (93%)
rename src/{ => relay}/parser/token.h (93%)
rename src/{ => relay}/parser/tokenizer.h (96%)
rename src/{ => relay}/printer/doc.cc (98%)
rename src/{ => relay}/printer/doc.h (97%)
rename src/{ => relay}/printer/meta_data.h (95%)
rename src/{ => relay}/printer/model_library_format_printer.cc (93%)
rename src/{ => relay}/printer/relay_text_printer.cc (99%)
rename src/{ => relay}/printer/text_printer.cc (95%)
rename src/{ => relay}/printer/text_printer.h (95%)
rename src/{ => relay}/printer/tir_text_printer.cc (97%)
rename src/{ => relay}/printer/tir_text_printer_debug.cc (98%)
rename src/{ => relay}/printer/tir_text_printer_debug.h (90%)
rename src/{ => relay}/printer/tvmscript_printer.cc (96%)
create mode 100644 src/runtime/crt/CMakeLists.txt
delete mode 100644 src/runtime/crt/Makefile
rename src/runtime/crt/{crt_config-template.h => crt_config.h.template} (65%)
create mode 100644 src/runtime/crt/host/CMakeLists.txt.template
delete mode 100644 src/runtime/crt/host/Makefile.template
copy src/runtime/crt/host/{main.cc => platform.cc} (56%)
create mode 100644 src/runtime/crt/platform-template.c
delete mode 100644 src/runtime/micro/crt_config.h
create mode 100644 src/script/printer/legacy_repr.cc
delete mode 100644 src/script/printer/printer.cc
create mode 100644 src/script/printer/utils.h
create mode 100644 src/target/source/codegen_webgpu.cc
create mode 100644 src/target/source/codegen_webgpu.h
create mode 100644 src/target/source/intrin_rule_webgpu.cc
create mode 100644 src/tir/analysis/identify_memcpy.cc
create mode 100644 src/tir/analysis/var_use_def_analysis.cc
create mode 100644 src/tir/analysis/var_use_def_analysis.h
delete mode 100644 src/tir/ir/legacy_printer.cc
create mode 100644 src/tir/transforms/inject_ptx_ldg32.cc
create mode 100644 tests/cpp-runtime/opencl/opencl_compile_to_bin.cc
create mode 100644 tests/micro/common/test_mlperftiny.py
delete mode 100644 tests/micro/project_api/test_arduino_microtvm_api_server.py
create mode 100644 tests/python/contrib/test_cmsisnn/test_last_error.py
mode change 100755 => 100644 tests/python/contrib/test_dnnl.py
mode change 100755 => 100644
tests/python/contrib/test_hexagon/test_2d_physical_buffers.py
create mode 100644
tests/python/contrib/test_hexagon/topi/slice_op/test_dense_slice.py
mode change 100755 => 100644
tests/python/contrib/test_hexagon/topi/slice_op/test_global_avg_pool2d.py
mode change 100755 => 100644
tests/python/contrib/test_hexagon/topi/test_adaptive_avg_pool1d.py
mode change 100644 => 100755 tests/python/frontend/paddlepaddle/test_forward.py
mode change 100755 => 100644 tests/python/frontend/pytorch/test_forward.py
mode change 100755 => 100644 tests/python/frontend/tensorflow/test_forward.py
create mode 100644 tests/python/relay/qnn/test_clip_legalization.py
create mode 100644 tests/python/relay/qnn/test_qnn_channel_stripping.py
create mode 100644 tests/python/relay/test_op_floordiv.py
create mode 100644 tests/python/topi/python/test_topi_dft.py
copy tests/python/topi/python/{test_topi_layer_norm.py =>
test_topi_group_norm.py} (62%)
copy tests/python/unittest/{test_autotvm_dispatch_context.py =>
test_arith_simplify.py} (63%)
copy tests/python/unittest/{test_tir_ptx_cp_async.py =>
test_inject_ptx_ldg32.py} (55%)
rename tests/python/unittest/{test_meta_schedule_vnni_integration.py =>
test_meta_schedule_cpu_dot_product.py} (83%)
create mode 100644 tests/python/unittest/test_meta_schedule_space_cuda_async.py
create mode 100644
tests/python/unittest/test_meta_schedule_space_cuda_async_multiple_initialization.py
create mode 100644 tests/python/unittest/test_tir_analysis_identify_memcpy.py
create mode 100644 tests/python/unittest/test_tir_host_func.py
mode change 100755 => 100644 tests/python/unittest/test_transform_layout.py
create mode 100644 tests/python/unittest/test_tvmscript_printer_annotation.py
create mode 100644 tests/python/unittest/test_tvmscript_printer_ir.py
copy tests/python/unittest/{test_tir_transform_remove_assume.py =>
test_tvmscript_printer_metadata.py} (50%)
create mode 100644
tests/python/unittest/test_tvmscript_printer_structural_equal.py
copy tests/scripts/{task_config_build_minimal.sh =>
task_config_build_minimal_cross_isa.sh} (61%)
copy tests/scripts/{task_config_build_minimal.sh =>
task_config_build_static.sh} (66%)
copy web/tests/node/{test_ndarray.js => test_object.js} (64%)