This is an automated email from the ASF dual-hosted git repository.
tqchen pushed a change to branch unity-merge-backup-04-13
in repository https://gitbox.apache.org/repos/asf/tvm.git
omit 815422cfc0 [microNPU] Add support for MEAN with uint8 ifm (#14353)
omit 606e2b738a [Frontend][Paddle] [PaddlePaddle Hackathon 4]add attribute
support for dropout/hard_sigmoid/pixel_shuffle (#14575)
omit 742c5eec74 [MetaSchedule] Handle cases when no features found by
FeatureExtractor (#14591)
omit b1ab4dc1d5 [LLVM] Validate generated LLVM module before optimization
(#14564)
omit ab93b31d0d [ARITH][TensorIR] Improve CompactBufferRegion for symbolic
shape (#14596)
omit 48d9165263 [microNPU] Fix skip tests when Vela is not present (#14587)
omit b4c1995a98 [Node] Allow alternative root names in ObjectPath::Root()
(#14569)
omit 3ef745c1cd [CI] Add JAX deps in Dockerfiles (#14550)
omit c1d1e9ffb8 [TIR] Add CUDA int4 tensor core intrinsics (#14598)
omit aee57f682f [MetaSchedule][ARM] Beautification of the function names
(#14584)
omit 17f7db16f5 [ARITH] Enhance IterMapSimplify for symbolic (#14547)
omit 1c52e633c7 [TIR][Schedule] Method returning the function being worked
on (#14593)
omit 40af75b61f [Fix][TIR] UnifyThreadBinding creating unit loop with
annotation (#14588)
omit ca7c3d8a14 [LLVM] Expand tvm::Type to DWARF conversion (#14568)
omit 7766f3c51e [Object] Implemented .as<T> for ObjectRef param, returns
Optional<T> (#14522)
omit f28fcd1239 [TensorIR] Fix ComputeAt with perfect symbolic bound
(#14592)
omit 8554e7afb4 [Bugfix] [Relay] fix a bug caused by IncompleteTypeNode in
EinsumRel while doing MergeComposite (#14556)
omit f622e7f180 [ARITH][BUGFIX] Fix a bug of iter map floormod(x,2)
simplify (#14571)
omit fb2ae1a676 [Arith] Fix solve inequality of unbound var ranges (#14582)
omit 9fb9fd6898 [TIR] Use String instead of StringImm for
AttrStmtNode::node (#14491)
omit 1db4464c08 [Fix][TIR][Analysis] Reduction block checking alloc_buffers
(#14589)
omit 1c5442d2e9 [Codegen][LLVM] Remove cast to i8* in builtin::address_of
(#14563)
omit b5c71bdab1 [Docs] Fix typo in the Vitis AI Integration docs (#14585)
omit 3a3118a754 feat: use spot instances for ci with on demand as a backup
(#14477)
omit 51dcafb4ab [microNPU][ETHOSU] Add restrictions to convert to NHCWB16
layout in LayoutOptimization pass (#14464)
omit 515583ce28 [AutoTVM] New rank-binary loss_type for the new xgboost >=
2.0.0 behaviour (#14468)
omit e5fc9f6add [skip ci][COMMUNITY] Zihao Ye -> Committer (#14578)
omit f79e4ebf30 [Bugfix][Topi] Output strides in pack_buffer() utility
(#14566)
omit a7a1980480 [CI] Update ci_cpu image and build with llvm-15 (#14466)
omit 6ef73e0cdb [TVMScript] Distinguish between void* and handle (#14488)
omit 4e07a8ed66 [TOPI] remove the i32 cast for output shape of pool (#14549)
omit 29d525bfc5 [ARITH] Enhance CanonicalSimplify to Simplify ProdDiv
(#14538)
omit c581fe3802 Update to v0.13.dev0 (#14544)
omit 0a0cbd69f5 [Target] Add Apple M1 GPU tag with 256-thread restriction
(#14539)
omit 15f9be5449 [TOPI] Expose `topi::collapse_sum` to Python and support
symbolic shape (#14541)
omit a84a2cbe07 [ARITH] Enhance CanProve to handle symbolic bound (#14523)
omit 8e9216013c [hexagon] Hexagon inference fix (#14533)
omit f990c0b09b [Target] Fix Jetson AGX Xavier CPU core count (#14508)
omit 2a23d5960b [CI] Pin sccache version to 0.3.3 (#14530)
omit 17bd178bfa [Docs] Fix MetaSchedule Docs (#14480)
omit e8cd33b601 [TIR] Update SplitHostDevice to post-process with
ConvertSSA (#14496)
omit 4d59c959aa [Target] Add A10G gpu cuda tag (#14467)
omit 7e232264b3 [LLVM] Use DataLayout::getABITypeAlign instead of
getABITypeAlignment
omit 2eeb37eef0 [Arith][Bugfix] Simplify "x - 1 < y" into "x <= y" (#14528)
omit e1b49c8cb6 [TOPI] Fix data race of batch multibox detection (#14343)
omit b98d036660 [Runtime] Runtime module property mask for Metal and Vulkan
omit 460374fed5 [TOPI] Support symbolic shape in einsum (#14521)
omit b228037a29 Expose attrs argument of "ir.IRModule" to Rust bindings
omit ff5118f398 [TVMScript] Expose IRModule::attrs as I.module_attrs
omit 11c13ace0b [TVMScript] IRModule TVMScript Parser.
omit af39b3441d [Node] Utility methods for ObjectPathPair handling (#14498)
omit 5239ec05e9 [TIR] [Schedule] Add get_output_blocks primitive (#14490)
omit 1113de2ce1 [relay] preserve the order of input_info of pytorch (#14462)
omit 28206d89d7 [TIR] Merged kDeviceThreadAxis and
kUseDynamicSharedMemoryTag (#14495)
omit 6caf08589b [Test][Topi] Avoid depending on f32 rounding behavior for
crop_and_divide tests (#13773)
omit fd7e3643fb [QNN] Convert fake quantized take to quantized op (#14506)
omit 7a73254590 [Relay] Check if the attribute "name" exists before
accessing it (#14485)
omit 287cd38651 [TIR] Improved SeqStmt::Flatten utility (#14497)
omit e51ba294d9 [ACL] Prevent offloading of per-channel quantized operators
(#14484)
omit 73ca486d2d [LLVM] Add missing `override` to GetFormat and
GetPropertyMask (#14470)
omit deb11d384e [TIR] Use IRModuleNode::Remove to remove None in
PrimFuncPass (#14494)
omit 4b6e635825 [LLVM] Add guard for #include
<llvm/Transforms/IPO/PassManagerBuilder.h> (#14469)
omit 9dcd40d61f [microNPU] Add support for ResizeNearestNeighbor with
half_pixel_centers=True (#14401)
omit 25ec6460ce [TIR] Use same DataType of builtin::tvm_struct_set in C++
and Python (#14489)
omit 579d999653 [pytest] Don't return values from test_* functions (#14475)
omit dba987cae0 [Arith] Simplifications for floormod(x, 2) (#13936)
omit f5db8b7ab5 [Bugfix] Conv1Dtranspose default kernel layout should be
IOW (#14482)
omit f8f7bc8946 [CI] Fix broken model link (#14458)
omit 44dd6445ef [TensorIR] Support for L2 prefetch async copy and
pred_guard enabled async in vectorized if_then_else (#14329)
omit 4d7e890407 [testing] Use tuples for numpy indexing (#14476)
omit 99a5734a9e [TIR] Add merge primitive for TIR schedule (#14398)
omit 76c8e66211 [Hexagon][TOPI] Use IndexMap axis separator instead of TE
(#14459)
omit 2c052b2067 [Frontend][Oneflow] Use FLOW_2_STR_DTYPE for dtype (#14454)
omit 8dea77a9ff [TOPI] dynamic externsion (#14450)
omit 6006d25f43 [TensorIR][Schedule] New primitive `reorder_block_itervar`
(#14448)
omit 66e18fbe1f [Bugfix][TVMScript] Handle LetStmt for `var1 = var2`
expressions (#14320)
add 70d80cc126 [Unity] Relax VM (#13878)
add 25e6174770 [Unity] Relax expressions and types (#13901)
add 51cf110c0c [Unity][IR] First-class StructInfo (#13907)
add 40c43108c2 [Unity][CI] Unity specific jenkins setup (do not upstream
to main) (#13910)
add e68ef58c84 [Unity] Basic StructInfo Analysis and Expr construction
(#13916)
add aa59d86ac7 [Unity] Relax BlockBuilder and ExprMutator (#13926)
add 540ba28f5c [Unity] Relax TVMScript Parser. (#13932)
add 334fd8be51 [Unity] Relax TVMScript Printer (#13944)
add 574084d340 [Unity] Relax VM codegen (#13954)
add 518a7bdd96 [Unity] Relax VM shape lowering pass (#13956)
add b3d913a737 [Unity] e2e Relax minimum build flow (#13961)
add caad7cdf48 [Unity][TVMScript] Use explicit `R.shape` in TVMScript
(#13979)
add 903628f106 [Unity] Relax op: index (#13987)
add 0319939e47 [Unity] Relax op: datatype (#13986)
add 11e5e1d6d4 [Unity] Relax op: set (#13990)
add 128042204c [Unity] Relax op: image (#13994)
add 64f281749a [Unity] Relax op: arithmetic, comparison (#13983)
add be4373a1d2 [Unity] Relax op: statistical (#13991)
add fa3cfbd60e [Unity] Relax op: neural networks (#13993)
add 45e6d0bade [Unity] Relax op: creation (#13984)
add 206762f0e6 [Unity] Relax op: linear algebra (#13988)
add 088afc5d54 [Unity] Relax op: search (#13992)
add 4430d26394 [Unity] Relax op: manipulation (#13989)
add 471ff84280 [Unity] NestedMsg Support utility (#13995)
add 604754a6c8 [Unity][Pass] Operator Fusion Passes (#14001)
add 50f616c95c [Unity][Pass] LambdaLift pass (#14012)
add 0a0166e81f [Unity][VM] Supporting "compiled" exec mode. (#14015)
add 660a1f361c [Unity][Pass] BindParams pass, FoldConstant pass (#14016)
add 30a5669b8a [Unity][Pass][TuningAPI] Introduce TuningAPI and
MetaSchedule pass (#14014)
add b792751aaa [Unity] Relay -> Relax translator (#14026)
add 554e609aca [Unity][Pass] Normalize Pass (#14031)
add 477d011093 [Unity][BlockBuilder] CallTE convert PrimValue args
(#14028)
add f61c330ec6 [Unity][Pass] Wellformed Analysis (#14032)
add 4df587aba6 [Unity][TVMScript] Move tir/relax import in script out of
__init__.py (#14033)
add d2b8ea330d [Unity][Pass] Operator legalization (#14029)
add e0c6ca0383 [Unity][Op] Add ShapeExpr Tests for Reshape Op (#14035)
add e15f59bcf0 [Unity] Initial PyTorch Frontend (#14037)
add e8ebd68e92 [Unity][Pass] Block-level static memory planning (#14038)
add 2e1d85149b [Unity] Disallow inline prim_func in relax IR (#14040)
add 60d3796e20 [Unity] Update tests to adapt to latest TVMScript syntax
(#14039)
add 7d70af3723 [Unity] Relax dataflow pattern language (matching) (#14041)
add 44b05d68bb [Unity] Statement rewriter for DataflowBlock (#14043)
add 7a71e53b69 [Unity][Pass] FuseOps FuseTIR fixes (#14044)
add e7cc2876ec [Unity][TVMScript] Overload `__neg__` for relax expr
(#14045)
add 5239022d97 [Unity][VM] Add per-op profiling support (#14053)
add 38178a3298 [Unity][BYOC] Add pattern-based partitioning pass (#14054)
add a7e936fca1 [Unity] Relax op: collapse sum (#14059)
add 7ce0a519e7 [Unity][Fix][Pass] Fix FuseOps for lack graph edges (#14058)
add 34d5944135 [Unity][Pass] Remove Unused Function (#14061)
add 2c5ccfa7a6 [Unity][BYOC] Add pass to merge composite functions to
offload large subgraphs (#14062)
add 7f5f816084 [Unity][Frontend] Annotate number of non-static input of FX
function (#14067)
add 73fe88d04a [Unity][Transform] Add LiftTransformParams pass (#14069)
add b137d22ed4 [Unity][BYOC][Pass] RunCodegen and TensorRT (#14078)
add 38f920d480 [Unity][Pass] Canonicalize Bindings (#14079)
add 2a51dcae46 [Unity] Add testcases for `expr_args_converter` (#14080)
add 026217d998 [Unity][BYOC] Add CUTLASS backend (#14081)
add 4365d1e29d [Unity][BYOC] Add DNNL backend (#14082)
add 2409be1934 [Unity][Op] `log_softmax` and `cross_entropy_with_logits`
(#14083)
add 9caab4d9d8 [Unity][Analysis] TIR pattern kind analysis for
multi-buffer write block (#14075)
add 872c9f23c6 [Unity][Fix][Pass] FoldConstant with DCE in dataflow block
(#14087)
add 6b87e35617 [Unity] Refactor Relax Build JIT UX (#14088)
add 3959707f19 [Unity][Relax] Set Shape Function to Be Host Function
(#14090)
add 79fe0a22aa [Unity] Fix typo in the comment (#14096)
add 779c54d9ad [Unity] Lower `shape_of` to a builtin (#14093)
add 3173f2855d [Unity] Relax Recursive function (#14092)
add 62a4c735fb [Unity][Layout] Add layout transformation analysis for
PrimFunc (#14066)
add 1bf8b63648 [Unity] Remove attributes of relax.print, assert and unique
(#14101)
add 30d204bb4a [Unity][BYOC]Add relax backend pattern registry (#14106)
add a1f8d72178 [Unity] Update tests again to adapt to latest TVMScript
syntax (#14115)
add 6a9caf40e7 [Unity][Fix] Fix bug in MergeCompositeFunctions (#14117)
add 3515f95ba0 [Unity][BlockBuilder] Add `name_hint` argument for `emit`
and `emit_output` (#14126)
add c1f304497e [Unity][WEB] Relax vm on web runtime (#14131)
add aaa457d304 [Unity] Add Global info (#14132)
add 837dd6c228 [Unity][BYOC] Add transposed matmul support to Relax
CUTLASS BYOC (#14128)
add a4b1ec097c [Unity][TVMScript] emit_te sugar (#14123)
add be170bb3d2 [Unity][BYOC] Assign group to unused bindings and ignroe
PrimFunc (#14139)
add 77f4c719f3 [Unity] Add callback to FuseOpsByPattern to check match
result is accepted (#14109)
add 1b6d104f68 [Unity][Legalize] Fix Scalar Constant Legalization (#14127)
add 788c12f84f [Unity][Pass] Enhance constant folding to fold relax ops by
evaluating them. (#14146)
add f17a7285bb [Unity][Debugging] AST printer (#14152)
add c78e68bf4d [Unity][Pass] Support Symbolic Shape Deduction during
BindParam (#14154)
add ab5aaa0d1b [Unity][Analysis] Checking function return struct info in
well-formed check (#14155)
add 89cff055d3 [Unity][BYOC] Use Relax legalize + CPU build for reference
in tests (#14162)
add 2ca340a30b [Unity] Add bind_constants option to FuseOpsByPattern
(#14151)
add 4fe3a6d1f4 [Unity][Analysis] Analysis for detecting recursion in Relax
(#14149)
add 2de852f203 [Unity][BYOC] Add batch matmul support to Relax CUTLASS
BYOC (#14166)
add 8de0ff78f5 [Unity][Op] Full support of Relax op `power` (#14171)
add 9797924d54 [Unity][Analysis] Restore Python bindings for var analyses
(#14180)
add 30dc2365e7 [Unity][OP] Add an operator for fused multi head attention
(#14150)
add 279317d4cd [Unity][WEBGPU] Codegen improvements and WebRuntime (#14187)
add 3f66edc8ed [Unity][Transform] LiftTransformParams handling multiple
functions (#14192)
add 283a3db675 [Unity][Op] Group normalization (#14194)
add c25659d88f [Unity][Op] Argmax and argmin (#14195)
add 2f114d28b3 [Unity][Op] Legalize `round`, `floor`, `ceil`, `sign`
(#14198)
add 012923572a [Unity][Frontend] FX translator supporting more ops (#14196)
add a115d03e76 [Unity][Frontend] FX translator returning weights with
`keep_params_as_input` (#14197)
add 27b6fa7cb4 [Unity][Fix] FX translating dtype (#14201)
add 6ab0abc08d [Unity][TIR][Pass] ForceNarrowIndexToInt32 (#14203)
add 3f4417c6db [Unity][Frontend] FX translator support torch.baddbmm
(#14202)
add 2d562f8926 [Unity] Introduce Default GPU Schedule Pass (#14182)
add 97dc05ee18 [Unity][Frontend] Attach imported model weights, deprecate
ImporterOutput (#14211)
add 033fc0e34b [Unity][Frontend] FX translator supports unwrapping unit
return tuple (#14212)
add 9f9cf2967c [Unity][WEB] Improve webgpu codegen options to skip
readonly (#14213)
add 8f2153cfb4 [Unity][Frontend] from_fx keeps parameters in order (#14214)
add 30848742cb [Unity][BYOC] Add dynamic shape support to CUTLASS matmul
(#14216)
add 73b03dadb9 [Unity][Transform] Memory plan across the IRModule (#14220)
add 780a579c72 [Unity][WEB] Simplify WebGPU Codegen per spec (#14225)
add 3345aeda4a [Unity] ensure memory.alloc_tensor/storage roundtrippable
(#14226)
add 30a9403729 [Unity] Improve implementation of FuseOps (#14229)
add 9bc1ecc672 [Unity][Transform] SimplifyNormInference (#14221)
add a6b37b80ce [Unity][Bugfix] Do not include `PrimFunc`s in the
dependency graph when checking for recursion (#14228)
add 3e03ca5abe [Unity][TVMScript] Enable Context-Aware Parsing (#14234)
add cea0161bcb [Unity][TVMScript] Fix prim_func lost issue in
relax.emit_te (#14189)
add 14a7de57b3 [Unity][WEB] Update text prompts for syntactical
correctness (#14237)
add 663c8211a4 [Unity][WEB] Improve ndarray cache (#14236)
add c7c4dee36b [Unity][Op][Tweak] Improve `StructInfo` inference for
`shape_of` (#14243)
add 633c9002e3 [Unity][Op] Add repeat, tile, conv2d_transpose, avg_pool2d
(#14238)
add 07f7fc0a9c [Unity] Allow user defined func attrs in emit_te (#14255)
add 04d6d15984 [Unity][BYOC] Add conv2d and residual block patterns for
Relax cutlass BYOC (#14252)
add df9d084553 [Unity][Frontend] Import `tanh` and fix `layer_norm`
(#14247)
add 71c23b7a11 [Unity] Consider target context for Relay to Relax
conversion (#14269)
add 70386ecc91 [Unity] Introduce call_dps_packed (#14183)
add c676d1392f [Unity][ci] Use CPU-SMALL instances (#14256)
add 121e1e7a03 [TVMScript][Unity] Improve PyLint Compatibility (#14276)
add 6be5c19c9c [Unity] Mark tests that need python3.8 compact.
add 68827fa1d6 [Unity] Fix StructInfo Infer for `vm.alloc_tensor` (#14283)
add e3f975543f [Unity][Op] Cumsum (#14297)
add 67e183c33f [Unity][DEBUG] Add Instrument (#14302)
add f96e09a9ce [Unity][Pass] Add a pass to alter the TIR implementation of
an operator (#14215)
add a46fd90478 [Unity][Op] Enable special dimension value 0 in reshape
(#14311)
add e0c4456f50 [Unity][Web] WebGPU explicit max buffer size (#14321)
add 4bfbd3a703 [Unity] Support pattern-based rewriting (#14312)
add 2f1642f128 [Unity][BYOC] Support matmul + residual block fusion in
CUTLASS BYOC (#14317)
add 778b343ed9 [Unity][BYOC] Improve expressiveness of the pattern check
function in FuseOpsByPattern (#14310)
add 7c5a570dc1 [Unity][Transform] Enhance RewriteDataflowReshape transform
(#14265)
add b10e2a2a31 [Unity][Contrib] Introduce several features of cutlass
profiler (#14275)
add f67657fe09 [Unity][TOPI] fp16 LayerNorm & GroupNorm (#14264)
add 5cd30b00ef [Unity][Transform] Automatic Layout Conversion (#14257)
add 59746ea9b0 [Unity][Transform] Simple Dead Code Elimination (#14262)
add 371c3a2f2f [Unity][Transform] Automatic Mixed Precision (#14242)
add 691983096c [Unity][Lint] Fix cpplint casting (#14333)
add 5229827888 [Unity][Transform] DefaultSchedule pass (#14266)
add 375a2921ea [Unity][Op] Fix Strided Slice Shape Inference (#14324)
add 920f98d8f9 [Unity][Pass] Reuse prior infra to implement more complete
DCE (#14334)
add f4a451d273 [Unity] Remove Python interface of RemoveUnusedFunction
(#14336)
add 050562bfe9 [Unity][BYOC] Update testcases to follow recent changes
(#14339)
add 7534531a6e [Unity][Frontend] FX exp and strided_slice fix (#14338)
add 021ce24d0c [Unity] Support model kwargs in dynamo_capture_subgraph
(#14349)
add fe34a55bee [Unity][BYOC] Check leaked intermediate variables in
cutlass patterns (#14350)
add 9ae5895aae [Unity][Transform] AMP out_dtype=float16 testcases (#14358)
add cd48ba9e33 [Unity][Fix] Fix block memory plan to handle bool (#14357)
add 57b90700d2 [Unity][Transform] Introduce data-dependent operation of
reshape and its constant folding (#14282)
add 27f0ad3230 [Unity][Transform] Fix AMP tests (#14360)
add 20cc2ff300 [Unity] Add support to append relay op attrs in translator
(#14356)
add 79492b81d9 [Unity][WEB] Support async pipeline creation (#14362)
add 7974e0f816 [Unity][Pass] Fix FuseOps error if there is no output of a
given group (#14354)
add b5cdcec5d6 [Unity][Fix] Infer Layout must support negative axes
(#14365)
add eb7b020c22 [Unity] Add More Ops For FX Translator (#14348)
add 14f7b5af61 [Unity][TVMScript] Update GlobalVar `checked_type_` when
`emit_te` (#14367)
add a0b28c486e [Unity][Fix] Allow scalar layout initialization (#14370)
add 9253f0fef1 [Unity] Also include output dtype in simt MathInstruction
(#14372)
add 8d8ced72a1 [Unity][VM] Add CUDA graph vm builtins (#14371)
add 983b569325 [Unity] Add missing #include <array> (#14383)
add d6efe9f042 [Unity][Transform] SplitCallTIRByPattern and CUTLASS
backend (#14274)
add f7f45dcc8e [Unity] Support simple dynamic-shape-aware fusion (#14396)
add 34b32cdf68 [Unity][Op] Add stop_lift_params (#14368)
add 6eb12ff85a [Unity][TVMScript] Fix Shape Var occurrence in Tensor
annotation (#14404)
add ae3cf5f5f1 [Unity][Transform] Common Subexpression Elimination (#14361)
add 6818799e74 [Unity][QNN][Hexagon]Support Relax Constants in the QNN
TOPI operations (#14386)
add a8205ed0c1 [Unity][Op] Conv1d (#14388)
add 906f69056d [Unity] Fix getting shapes for cutlass BYOC kernels (#14411)
add 5426085988 [Unity][Op] Expose scale in `R.nn.attention` and add its
legalize op (#14412)
add 7f44914317 [Unity][Hexagon] Enable Relax VM for Hexagon (#14415)
add 94b9bbc6bc [Unity][Fix] Copy over module attrs in FuseTIR (#14418)
add 4d29a7cff8 [Unity] Handle extern func calls in static memory planning
(#14419)
add e424eea3dd [Unity] Include constant shapes in the profiler result
(#14428)
add b1346b97a9 [Unity][Fix] Annotate TIR op pattern could have no stores.
(#14420)
add beb25a701c [Unity] Minor updates to DataFlowBlockRewrite (#14431)
add 22c363a72a [Unity] Remove non-deterministic behavior from graph
pattern matching (#14417)
add 61f7edc1af [Unity][Graph matching] Automatically add `used-by`
constraints for `is_op` pattern (#14439)
add ad4bfbc5f9 [Unity][Op][Docs] Update comment for `call_tir_dyn` (#14441)
add 0a4ce4dfc3 [Unity][Graph matching] Clean up undo stack for parent and
child nodes properly (#14440)
add 52f6e639de [Unity] Pattern-based rewriting for dataflow block (#14446)
add 5c8b7af59c [Unity] Fix lint during for upstream black
add 1527bfba04 [TVMScript][Bugfix] Tuple on the RHS of AssignDoc (#14452)
add 97ab25c33f [Unity][BYOC] Faster cutlass codegen (#14465)
add db01567b09 [Unity][Op] introduce `shape_to_tensor` op (#14447)
add ea56542f96 [Unity][CI] Update gpu and lint image (#14473)
add 751da24bce [Unity][BYOC] Fix `RunCodegen` pass on symbolic shape
(#14472)
add e54e04d520 [Unity][BYOC] Add cuBLAS backend (#14291)
add 5c2058dc3c [Unity][OP] Add `arange` op (#14463)
add 3b8aa040e4 [Unity][op] Relax rsqrt and sinh (#14479)
add aec773d690 [Unity][VM] Add Attention KV cache builtin (#14478)
add b7d3762f39 [Unity][Hexagon] Don't pass raw scalars in
hexagon/qnn/nn.py (#14474)
add 6bcd375369 [Unity] Torch-like NN module enhancement (#14499)
add 1eea30d54a [Unity] LiftTransformParams with symbolic shape robustness
(#14500)
add e93ee9fc56 [Unity][Op] vm.alloc_tensor infer struct info (#14503)
add 22878a52fa [Unity][Refactor] Use NameSupply for unique symbol
generation and remove NameTable (#14507)
add d9f3aa32ef [Unity][Analysis] Get symbolic TIR vars from struct info
(#14509)
add 8e4f94a689 [Unity][Op] Extend `relax.op.take` op to match behavior
with `topi.take`. (#14481)
add bd003e847b [Unity] Memory planning with TIR var upper bound (#14511)
add 3bd516701c [Unity] handle bf16 in dump_ndarray_cache and
load_ndarray_cache (#14514)
add a92258ec55 [Unity] Support Simple Dynamic-Shape-Aware in FuseTIR
(#14515)
add 8474255a04 [Fix] symbolic thread extent program compilation (#14516)
add 603f8bd721 [Unity][Op] Negative Log Likelihood Loss (#14517)
add bb479e66dd [Unity][Transform] Some Improvements on pass DecomposeOps
(#14512)
add cea447cf37 [Unity][Graph matching] Improved matching algorithm and
implementation (#14501)
add 34695afa5c [Unity][Op] introduce `ScatterElement` op (#14493)
add 8ea976276c [Unity] Properly handle tuple-outputting function in
`FuseOpsByPattern` (#14525)
add d5182388a5 [Unity][Op] Gradient functions for high-level Relax
operators (#14527)
add 0c447d6f9c [Unity][Transform] High-level reverse-mode automatic
differentiation pass (#14542)
add 97f4db0f91 [Unity] DefaultGPUSchedule working for targets other than
CUDA (#14540)
add b816aa26db [Unity][Pass] Enhance Dynamic-aware FuseOps (#14543)
add add45b5c1c [Unity] Make FuseOps work on a call_tir which has a
ShapeExpr arg (#14553)
add ac645b2348 [Unity] Fix emit_te with symbolic input (#14552)
add d6c8f7933b [Unity] Preserve symbolic var args when applying call_tir
(#14555)
add a6831ba9c4 [Unity] Enable pod args in WebGPU (#14560)
add 876ed385ce [Unity][Op] Symbolic shape support of take grad (#14559)
add fadf144f27 [Unity][Op] add einsum and flip in Relax (#14545)
add 8b4dcfd1f1 [Unity][TOPI] Symbolic shape support for `collapse_sum`
(#14535)
add c51bd8b76d [Unity] Enhance Dynamic-aware FuseTIR (#14577)
add ddbd3d7ec0 [Unity][TVMScript] Update struct_info for GlobalVar (#14579)
add eeae66b301 [Unity][MetaSchedule] Skip Scheduled PrimFuncs in Task
Generation (#14402)
add 88f5b8f590 [Unity][PyTorch] Disable gradient during dynamo subgraph
capture to save RAM (#14602)
add 61fbf4262d [Unity] Fix FuseTIR when the same buffer is read multiple
times with different access pattern (#14603)
add e416226bd7 [Unity][MetaSchedule] BlockCollector focusing on current
func (#14595)
add 7db0b984de [Unity][Op] Dynamic Strided Slice (#14548)
add a59c09de77 [Unity] Add pass for combining parallel matmul (#14583)
add 77b35e80d8 [Unity][BYOC] Add fused patterns for stacked attention
(#14608)
add d611700b80 [Unity] Fix ForceNarrowI32 with pod arguments (#14605)
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (815422cfc0)
\
N -- N -- N refs/heads/unity-merge-backup-04-13 (d611700b80)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
3rdparty/cutlass | 2 +-
CMakeLists.txt | 9 +
CONTRIBUTORS.md | 1 -
apps/benchmark/adreno/adreno_gpu_bench_clml.py | 36 +-
apps/benchmark/adreno/adreno_gpu_bench_texture.py | 36 +-
apps/relax_examples/e2e_auto_tir.py | 253 ++
apps/relax_examples/mlp.py | 57 +
apps/relax_examples/nn_module.py | 69 +
apps/relax_examples/resnet.py | 53 +
ci/jenkins/docker-images.ini | 2 +-
ci/jenkins/generated/arm_jenkinsfile.groovy | 115 +-
ci/jenkins/generated/cortexm_jenkinsfile.groovy | 159 +-
ci/jenkins/generated/cpu_jenkinsfile.groovy | 93 +-
ci/jenkins/generated/docker_jenkinsfile.groovy | 21 +-
ci/jenkins/generated/gpu_jenkinsfile.groovy | 172 +-
ci/jenkins/generated/hexagon_jenkinsfile.groovy | 115 +-
ci/jenkins/generated/i386_jenkinsfile.groovy | 60 +-
ci/jenkins/generated/lint_jenkinsfile.groovy | 17 +-
.../generated/minimal_cross_isa_jenkinsfile.groovy | 38 +-
ci/jenkins/generated/minimal_jenkinsfile.groovy | 38 +-
ci/jenkins/generated/riscv_jenkinsfile.groovy | 38 +-
ci/jenkins/generated/wasm_jenkinsfile.groovy | 27 +-
ci/jenkins/templates/utils/Prepare.groovy.j2 | 4 +-
ci/jenkins/templates/utils/base.groovy.j2 | 6 +-
ci/jenkins/templates/utils/macros.j2 | 23 +-
ci/jenkins/unity_jenkinsfile.groovy | 337 +++
cmake/modules/CUDA.cmake | 8 +-
cmake/modules/contrib/CUTLASS.cmake | 4 +-
cmake/modules/contrib/DNNL.cmake | 8 +-
cmake/modules/contrib/TensorRT.cmake | 2 +-
conda/recipe/meta.yaml | 2 +-
docker/Dockerfile.ci_cpu | 4 -
docker/Dockerfile.ci_gpu | 3 -
docker/install/ubuntu_install_jax.sh | 35 -
docker/install/ubuntu_install_sccache.sh | 2 +-
docs/how_to/deploy/vitis_ai.rst | 2 +-
docs/reference/api/python/index.rst | 1 -
docs/reference/api/python/meta_schedule.rst | 23 -
.../how_to/deploy_models/deploy_model_on_adreno.py | 38 +-
gallery/how_to/tune_with_autotvm/tune_relay_arm.py | 24 +-
.../how_to/tune_with_autotvm/tune_relay_cuda.py | 24 +-
.../tune_with_autotvm/tune_relay_mobile_gpu.py | 24 +-
gallery/how_to/tune_with_autotvm/tune_relay_x86.py | 24 +-
gallery/tutorial/autotvm_relay_x86.py | 39 +-
include/tvm/arith/analyzer.h | 40 +-
include/tvm/arith/int_set.h | 16 -
include/tvm/arith/iter_affine_map.h | 3 +-
include/tvm/ir/expr.h | 9 +
include/tvm/ir/function.h | 133 +-
include/tvm/ir/global_info.h | 80 +
include/tvm/ir/module.h | 22 +-
include/tvm/ir/name_supply.h | 46 +-
include/tvm/ir/transform.h | 54 +-
include/tvm/ir/type.h | 3 +-
include/tvm/node/object_path.h | 6 +-
include/tvm/node/script_printer.h | 12 +
include/tvm/node/structural_equal.h | 53 +-
include/tvm/relax/analysis.h | 470 +++
include/tvm/relax/attrs/create.h | 54 +
include/tvm/relax/attrs/datatype.h | 53 +
include/tvm/relax/attrs/image.h | 81 +
include/tvm/relax/attrs/index.h | 62 +
include/tvm/relax/attrs/linear_algebra.h | 53 +
include/tvm/relax/attrs/manipulate.h | 153 +
include/tvm/relax/attrs/nn.h | 326 ++
include/tvm/relax/attrs/search.h | 48 +
include/tvm/relax/attrs/statistical.h | 63 +
include/tvm/relax/backend.h | 51 +
include/tvm/relax/binding_rewrite.h | 115 +
include/tvm/relax/block_builder.h | 240 ++
include/tvm/relax/dataflow_matcher.h | 74 +
include/tvm/relax/dataflow_pattern.h | 828 ++++++
include/tvm/relax/dataflow_pattern_functor.h | 183 ++
include/tvm/relax/exec_builder.h | 181 ++
include/tvm/relax/expr.h | 1039 +++++++
include/tvm/relax/expr_functor.h | 551 ++++
include/tvm/relax/nested_msg.h | 580 ++++
include/tvm/relax/op_attr_types.h | 75 +
include/tvm/relax/struct_info.h | 435 +++
include/tvm/relax/struct_info_functor.h | 151 +
include/tvm/relax/tir_pattern.h | 75 +
include/tvm/relax/transform.h | 488 +++
include/tvm/relax/tuning_api.h | 396 +++
include/tvm/relax/type.h | 166 ++
include/tvm/relax/utils.h | 99 +
include/tvm/relay/attrs/nn.h | 6 +-
include/tvm/relay/transform.h | 2 +-
include/tvm/runtime/c_runtime_api.h | 2 +-
include/tvm/runtime/container/optional.h | 9 -
include/tvm/runtime/module.h | 4 +
include/tvm/runtime/object.h | 39 +-
include/tvm/runtime/relax_vm/builtin.h | 89 +
include/tvm/runtime/relax_vm/bytecode.h | 223 ++
include/tvm/runtime/relax_vm/executable.h | 213 ++
include/tvm/runtime/relax_vm/memory_manager.h | 142 +
include/tvm/runtime/relax_vm/vm.h | 187 ++
include/tvm/script/ir_builder/ir/frame.h | 4 +
include/tvm/script/ir_builder/relax/frame.h | 293 ++
include/tvm/script/ir_builder/relax/ir.h | 144 +
include/tvm/script/ir_builder/tir/ir.h | 18 +-
include/tvm/te/operation.h | 2 +-
include/tvm/tir/buffer.h | 14 +-
include/tvm/tir/builtin.h | 44 +
include/tvm/tir/data_type_rewriter.h | 9 +-
include/tvm/tir/function.h | 53 +-
include/tvm/tir/schedule/schedule.h | 25 -
include/tvm/tir/stmt.h | 36 +-
include/tvm/tir/transform.h | 35 +-
include/tvm/topi/detail/extern.h | 2 +-
include/tvm/topi/nn/dilate.h | 3 +-
include/tvm/topi/nn/group_norm.h | 31 +-
include/tvm/topi/nn/layer_norm.h | 28 +-
include/tvm/topi/nn/pooling.h | 58 +-
include/tvm/topi/transform.h | 58 +-
python/tvm/_ffi/libinfo.py | 14 +-
python/tvm/arith/__init__.py | 3 +-
python/tvm/arith/analyzer.py | 27 -
python/tvm/arith/iter_affine_map.py | 43 -
python/tvm/autotvm/testing/tune_relay.py | 2 +-
python/tvm/autotvm/tuner/xgboost_cost_model.py | 45 +-
python/tvm/autotvm/tuner/xgboost_tuner.py | 4 +-
python/tvm/contrib/cutlass/attention_operation.py | 162 +
python/tvm/contrib/cutlass/build.py | 358 ++-
python/tvm/contrib/cutlass/conv2d_operation.py | 29 +-
python/tvm/contrib/cutlass/gemm_operation.py | 149 +-
python/tvm/contrib/cutlass/gemm_profiler.py | 4 +-
python/tvm/contrib/cutlass/gen_conv2d.py | 119 +-
python/tvm/contrib/cutlass/gen_gemm.py | 74 +-
python/tvm/contrib/cutlass/gen_tensor_op.py | 260 +-
python/tvm/contrib/cutlass/library.py | 8 +
python/tvm/contrib/hexagon/session.py | 37 +-
python/tvm/contrib/torch/pytorch_tvm.py | 24 +-
python/tvm/contrib/tvmjs.py | 305 ++
python/tvm/driver/tvmc/autotuner.py | 46 +-
python/tvm/exec/rpc_proxy.py | 38 +-
python/tvm/ir/__init__.py | 1 +
python/tvm/ir/expr.py | 61 +-
python/tvm/ir/function.py | 42 +-
python/tvm/ir/global_info.py | 42 +
python/tvm/ir/module.py | 53 +-
python/tvm/ir/transform.py | 95 +-
python/tvm/meta_schedule/__init__.py | 1 +
python/tvm/meta_schedule/cost_model/xgb_model.py | 15 +-
python/tvm/meta_schedule/database/database.py | 7 +-
python/tvm/meta_schedule/postproc/postproc.py | 2 +-
python/tvm/meta_schedule/relax_integration.py | 356 +++
.../meta_schedule/schedule_rule/schedule_rule.py | 4 +-
.../search_strategy/search_strategy.py | 6 +-
.../space_generator/space_generator.py | 4 +-
python/tvm/meta_schedule/tir_integration.py | 89 +
python/tvm/meta_schedule/tune.py | 15 +-
python/tvm/meta_schedule/tune_context.py | 7 +-
python/tvm/meta_schedule/utils.py | 50 +-
python/tvm/relax/__init__.py | 96 +
python/tvm/relax/_ffi_api.py | 20 +
python/tvm/relax/analysis/__init__.py | 21 +
python/tvm/relax/analysis/_ffi_api.py | 19 +
python/tvm/relax/analysis/analysis.py | 475 +++
python/tvm/relax/analysis/estimate_memory_usage.py | 165 +
python/tvm/relax/backend/__init__.py | 20 +
python/tvm/relax/backend/_ffi_api.py | 21 +
python/tvm/relax/backend/contrib/__init__.py | 20 +
python/tvm/relax/backend/contrib/cublas.py | 154 +
python/tvm/relax/backend/contrib/cutlass.py | 293 ++
python/tvm/relax/backend/pattern_registry.py | 119 +
python/tvm/relax/backend/patterns.py | 239 ++
python/tvm/relax/backend_tir/__init__.py | 20 +
python/tvm/relax/backend_tir/contrib/__init__.py | 20 +
python/tvm/relax/backend_tir/contrib/cutlass.py | 720 +++++
python/tvm/relax/backend_tir/pattern.py | 576 ++++
python/tvm/relax/binding_rewrite.py | 155 +
python/tvm/relax/block_builder.py | 657 ++++
python/tvm/relax/dpl/__init__.py | 22 +
python/tvm/relax/dpl/_ffi.py | 20 +
python/tvm/relax/dpl/context.py | 80 +
python/tvm/relax/dpl/pattern.py | 1125 +++++++
python/tvm/relax/dpl/rewrite.py | 115 +
python/tvm/relax/exec_builder.py | 147 +
python/tvm/relax/expr.py | 706 +++++
python/tvm/relax/expr_functor.py | 1530 ++++++++++
python/tvm/relax/frontend/__init__.py | 20 +
python/tvm/relax/frontend/common.py | 55 +
python/tvm/relax/frontend/torch/__init__.py | 21 +
python/tvm/relax/frontend/torch/dynamo.py | 173 ++
python/tvm/relax/frontend/torch/fx_translator.py | 1310 ++++++++
python/tvm/relax/ir/instrument.py | 37 +
python/tvm/relax/op/__init__.py | 52 +
python/tvm/relax/op/_ffi_api.py | 19 +
python/tvm/relax/op/_op_gradient.py | 1198 ++++++++
python/tvm/relax/op/base.py | 468 +++
python/tvm/relax/op/binary.py | 287 ++
python/tvm/relax/op/builtin/__init__.py | 20 +
python/tvm/relax/op/builtin/_ffi_api.py | 19 +
python/tvm/relax/op/builtin/builtin.py | 70 +
python/tvm/relax/op/create.py | 261 ++
python/tvm/relax/op/datatype.py | 59 +
python/tvm/relax/op/grad/__init__.py | 20 +
python/tvm/relax/op/grad/_ffi_api.py | 20 +
python/tvm/relax/op/grad/grad.py | 144 +
python/tvm/relax/op/image/__init__.py | 19 +
python/tvm/relax/op/image/_ffi_api.py | 20 +
python/tvm/relax/op/image/image.py | 128 +
python/tvm/relax/op/index.py | 130 +
python/tvm/relax/op/linear_algebra.py | 112 +
python/tvm/relax/op/manipulate.py | 488 +++
python/tvm/relax/op/memory/__init__.py | 20 +
python/tvm/relax/op/memory/_ffi_api.py | 19 +
python/tvm/relax/op/memory/memory.py | 126 +
python/tvm/relax/op/nn/__init__.py | 19 +
python/tvm/relax/op/nn/_ffi_api.py | 20 +
python/tvm/relax/op/nn/nn.py | 1011 +++++++
python/tvm/relax/op/op_attrs.py | 159 +
python/tvm/relax/op/search.py | 104 +
python/tvm/relax/op/set.py | 110 +
python/tvm/relax/op/statistical.py | 269 ++
python/tvm/relax/op/ternary.py | 43 +
python/tvm/relax/op/unary.py | 553 ++++
python/tvm/relax/op/vm/__init__.py | 20 +
python/tvm/relax/op/vm/_ffi_api.py | 19 +
python/tvm/relax/op/vm/vm.py | 110 +
python/tvm/relax/pipeline.py | 84 +
python/tvm/relax/struct_info.py | 197 ++
python/tvm/relax/testing/__init__.py | 23 +
python/tvm/relax/testing/ast_printer.py | 372 +++
python/tvm/relax/testing/lib_comparator.py | 128 +
python/tvm/relax/testing/matmul.py | 66 +
python/tvm/relax/testing/nn.py | 196 ++
python/tvm/relax/testing/relay_translator.py | 265 ++
python/tvm/relax/testing/runtime_builtin.py | 34 +
python/tvm/relax/testing/transform.py | 125 +
python/tvm/relax/testing/vm.py | 85 +
python/tvm/relax/transform/__init__.py | 23 +
python/tvm/relax/transform/_ffi_api.py | 19 +
.../tvm/relax/transform/legalize_ops/__init__.py | 29 +
python/tvm/relax/transform/legalize_ops/binary.py | 59 +
python/tvm/relax/transform/legalize_ops/common.py | 120 +
.../tvm/relax/transform/legalize_ops/creation.py | 85 +
.../tvm/relax/transform/legalize_ops/datatype.py | 31 +
python/tvm/relax/transform/legalize_ops/grad.py | 219 ++
python/tvm/relax/transform/legalize_ops/image.py | 39 +
python/tvm/relax/transform/legalize_ops/index.py | 125 +
.../relax/transform/legalize_ops/linear_algebra.py | 103 +
.../tvm/relax/transform/legalize_ops/manipulate.py | 165 +
python/tvm/relax/transform/legalize_ops/nn.py | 403 +++
python/tvm/relax/transform/legalize_ops/search.py | 41 +
.../relax/transform/legalize_ops/statistical.py | 88 +
python/tvm/relax/transform/legalize_ops/unary.py | 38 +
python/tvm/relax/transform/transform.py | 1250 ++++++++
python/tvm/relax/transform/tuning_api/__init__.py | 22 +
python/tvm/relax/transform/tuning_api/_ffi_api.py | 19 +
python/tvm/relax/transform/tuning_api/database.py | 273 ++
.../transform/tuning_api/default_functions.py | 306 ++
.../tvm/relax/transform/tuning_api/primitives.py | 419 +++
python/tvm/relax/ty.py | 75 +
python/tvm/relax/utils.py | 462 +++
python/tvm/relax/vm_build.py | 329 ++
python/tvm/relay/backend/contrib/ethosu/codegen.py | 90 +-
.../tvm/relay/backend/contrib/ethosu/legalize.py | 62 +-
.../tvm/relay/backend/contrib/ethosu/op/pooling.py | 10 +-
.../tvm/relay/backend/contrib/ethosu/te/pooling.py | 6 +-
python/tvm/relay/frontend/keras.py | 12 +-
python/tvm/relay/frontend/mxnet.py | 2 +-
python/tvm/relay/frontend/oneflow.py | 4 +-
python/tvm/relay/frontend/paddlepaddle.py | 14 +-
python/tvm/relay/frontend/pytorch.py | 16 -
python/tvm/relay/op/contrib/arm_compute_lib.py | 24 +-
python/tvm/relay/op/contrib/ethosu.py | 57 +-
python/tvm/relay/op/nn/nn.py | 2 +-
python/tvm/relay/op/op_attrs.py | 10 -
python/tvm/relay/qnn/op/_qnn.py | 6 +-
.../transform/fake_quantization_to_integer.py | 11 -
python/tvm/rpc/proxy.py | 22 +-
python/tvm/runtime/module.py | 10 +-
python/tvm/runtime/object_path.py | 6 +-
python/tvm/runtime/relax_vm.py | 508 ++++
python/tvm/runtime/script_printer.py | 72 +-
python/tvm/script/__init__.py | 1 -
python/tvm/script/ir_builder/ir/__init__.py | 2 +
python/tvm/script/ir_builder/ir/ir.py | 27 +-
python/tvm/script/ir_builder/relax/__init__.py | 20 +
python/tvm/script/ir_builder/relax/_ffi_api.py | 20 +
python/tvm/script/ir_builder/relax/frame.py | 55 +
python/tvm/script/ir_builder/relax/ir.py | 672 +++++
python/tvm/script/ir_builder/tir/ir.py | 22 +-
python/tvm/script/parser/__init__.py | 5 +-
python/tvm/script/parser/core/entry.py | 3 +
python/tvm/script/parser/core/parser.py | 51 +-
python/tvm/script/parser/core/utils.py | 26 +-
python/tvm/script/parser/ir/__init__.py | 2 +-
python/tvm/script/parser/ir/parser.py | 34 +-
python/tvm/script/parser/relax/__init__.py | 43 +
python/tvm/script/parser/relax/entry.py | 341 +++
python/tvm/script/parser/relax/parser.py | 359 +++
python/tvm/script/parser/tir/__init__.py | 2 +-
python/tvm/script/parser/tir/parser.py | 7 +-
python/tvm/script/relax.py | 18 +
python/tvm/script/tir.py | 18 +
python/tvm/te/__init__.py | 1 +
python/tvm/te/operation.py | 54 +-
python/tvm/tir/op.py | 70 +-
python/tvm/tir/schedule/schedule.py | 114 -
python/tvm/tir/tensor_intrin/cuda.py | 220 +-
python/tvm/tir/transform/function_pass.py | 3 +-
python/tvm/tir/transform/transform.py | 33 +
python/tvm/topi/cuda/ssd/multibox.py | 4 +-
python/tvm/topi/hexagon/qnn/nn.py | 57 +-
python/tvm/topi/hexagon/utils.py | 35 +-
python/tvm/topi/nn/group_norm.py | 2 +
python/tvm/topi/nn/layer_norm.py | 2 +
python/tvm/topi/scan.py | 2 +-
python/tvm/topi/scatter_elements.py | 2 +-
python/tvm/topi/testing/group_norm_python.py | 5 +-
python/tvm/topi/testing/layer_norm_python.py | 3 +
python/tvm/topi/testing/poolnd_python.py | 6 +-
python/tvm/topi/transform.py | 35 +
python/tvm/topi/vision/ssd/multibox.py | 32 +-
python/tvm/topi/x86/conv2d.py | 10 +-
rust/tvm/src/ir/module.rs | 16 +-
rust/tvm/src/ir/relay/mod.rs | 2 +
src/arith/analyzer.cc | 64 +-
src/arith/bound_deducer.cc | 4 -
src/arith/canonical_simplify.cc | 132 +-
src/arith/int_constraints.cc | 25 +-
src/arith/int_set.cc | 29 +-
src/arith/interval_set.h | 9 +-
src/arith/ir_mutator_with_analyzer.cc | 14 +-
src/arith/ir_mutator_with_analyzer.h | 29 +-
src/arith/iter_affine_map.cc | 438 +--
src/arith/pattern_match.h | 1 +
src/arith/product_normal_form.h | 89 -
src/arith/rewrite_simplify.cc | 113 +-
src/arith/rewrite_simplify.h | 1 +
src/contrib/hybrid/codegen_hybrid.cc | 4 +-
src/driver/driver_api.cc | 7 +-
src/ir/expr.cc | 12 +-
src/ir/function.cc | 38 +-
src/ir/global_info.cc | 32 +
src/ir/module.cc | 111 +-
src/ir/name_supply.cc | 8 +-
src/ir/transform.cc | 84 +-
src/ir/type.cc | 3 +-
src/ir/type_functor.cc | 8 +-
src/meta_schedule/database/database_utils.cc | 4 +-
.../feature_extractor/per_store_feature.cc | 11 +-
src/meta_schedule/postproc/verify_gpu_code.cc | 4 +-
src/meta_schedule/postproc/verify_vtcm_limit.cc | 4 +-
src/meta_schedule/schedule_rule/schedule_rule.cc | 8 +-
.../space_generator/post_order_apply.cc | 62 -
src/meta_schedule/space_generator/schedule_fn.cc | 8 +-
src/meta_schedule/utils.h | 72 +
src/node/object_path.cc | 20 +-
src/node/script_printer.cc | 30 +-
src/node/structural_equal.cc | 51 +-
src/relax/analysis/analysis.cc | 153 +
src/relax/analysis/detect_recursion.cc | 398 +++
src/relax/analysis/layout_transformation.cc | 621 ++++
src/relax/analysis/shape_analysis.cc | 55 +
src/relax/analysis/struct_info_analysis.cc | 1027 +++++++
src/relax/analysis/tir_op_pattern_kind.cc | 453 +++
src/relax/analysis/udchain.cc | 105 +
src/relax/analysis/var2value.cc | 91 +
src/relax/analysis/well_formed.cc | 496 ++++
.../backend/contrib/codegen_json/codegen_json.h | 419 +++
src/relax/backend/contrib/cublas/codegen.cc | 110 +
src/relax/backend/contrib/cutlass/codegen.cc | 277 ++
src/relax/backend/contrib/dnnl/codegen.cc | 105 +
src/relax/backend/contrib/tensorrt/codegen.cc | 267 ++
src/relax/backend/contrib/utils.cc | 68 +
src/relax/backend/contrib/utils.h | 140 +
src/relax/backend/pattern_registry.cc | 77 +
src/relax/backend/pattern_registry.h | 73 +
src/relax/backend/task_extraction.cc | 109 +
src/relax/backend/vm/codegen_vm.cc | 452 +++
src/relax/backend/vm/codegen_vm_tir.cc | 511 ++++
src/relax/backend/vm/exec_builder.cc | 399 +++
src/relax/backend/vm/vm_builtin_lower.cc | 232 ++
src/relax/backend/vm/vm_shape_lower.cc | 730 +++++
src/relax/ir/binding_rewrite.cc | 337 +++
src/relax/ir/block_builder.cc | 953 ++++++
src/relax/ir/dataflow_matcher.cc | 931 ++++++
src/relax/ir/dataflow_matcher_impl.h | 87 +
src/relax/ir/dataflow_pattern.cc | 622 ++++
src/relax/ir/dataflow_pattern_functor.cc | 111 +
src/relax/ir/emit_te.cc | 78 +
src/relax/ir/emit_te.h | 68 +
src/relax/ir/expr.cc | 580 ++++
src/relax/ir/expr_functor.cc | 793 +++++
src/relax/ir/py_expr_functor.cc | 649 ++++
src/relax/ir/struct_info.cc | 207 ++
src/relax/ir/struct_info_functor.cc | 130 +
src/relax/ir/tir_pattern.cc | 37 +
src/relax/ir/transform.cc | 413 +++
src/relax/ir/type.cc | 88 +
src/relax/op/image/resize.cc | 128 +
src/relax/op/image/resize.h | 43 +
src/relax/op/nn/attention.cc | 129 +
src/relax/op/nn/attention.h | 41 +
src/relax/op/nn/convolution.cc | 498 ++++
src/relax/op/nn/convolution.h | 79 +
src/relax/op/nn/nn.cc | 722 +++++
src/relax/op/nn/nn.h | 95 +
src/relax/op/nn/pooling.cc | 246 ++
src/relax/op/nn/pooling.h | 50 +
src/relax/op/op.cc | 563 ++++
src/relax/op/op_common.cc | 154 +
src/relax/op/op_common.h | 352 +++
src/relax/op/tensor/binary.cc | 127 +
src/relax/op/tensor/binary.h | 112 +
src/relax/op/tensor/create.cc | 322 ++
src/relax/op/tensor/create.h | 93 +
src/relax/op/tensor/datatype.cc | 89 +
src/relax/op/tensor/datatype.h | 53 +
src/relax/op/tensor/grad.cc | 167 ++
src/relax/op/tensor/grad.h | 66 +
src/relax/op/tensor/index.cc | 317 ++
src/relax/op/tensor/index.h | 65 +
src/relax/op/tensor/linear_algebra.cc | 194 ++
src/relax/op/tensor/linear_algebra.h | 57 +
src/relax/op/tensor/manipulate.cc | 1441 +++++++++
src/relax/op/tensor/manipulate.h | 174 ++
src/relax/op/tensor/search.cc | 176 ++
src/relax/op/tensor/search.h | 49 +
src/relax/op/tensor/set.cc | 139 +
src/relax/op/tensor/set.h | 37 +
src/relax/op/tensor/statistical.cc | 195 ++
src/relax/op/tensor/statistical.h | 105 +
src/relax/op/tensor/ternary.cc | 125 +
src/relax/op/tensor/ternary.h | 45 +
src/relax/op/tensor/unary.cc | 92 +
src/relax/op/tensor/unary.h | 147 +
src/relax/transform/alter_op_impl.cc | 312 ++
src/relax/transform/annotate_tir_op_pattern.cc | 55 +
src/relax/transform/attach_global_symbol.cc | 68 +
src/relax/transform/bind_params.cc | 169 ++
src/relax/transform/call_tir_rewrite.cc | 138 +
src/relax/transform/canonicalize_bindings.cc | 135 +
src/relax/transform/combine_parallel_matmul.cc | 337 +++
src/relax/transform/convert_layout.cc | 309 ++
src/relax/transform/dead_code_elimination.cc | 132 +
src/relax/transform/decompose_ops.cc | 269 ++
src/relax/transform/eliminate_common_subexpr.cc | 209 ++
src/relax/transform/fold_constant.cc | 343 +++
src/relax/transform/fuse_ops.cc | 1265 ++++++++
src/relax/transform/fuse_tir.cc | 941 ++++++
src/relax/transform/gradient.cc | 469 +++
src/relax/transform/infer_amp_utils.cc | 59 +
src/relax/transform/infer_amp_utils.h | 85 +
src/relax/transform/infer_layout_utils.cc | 126 +
src/relax/transform/infer_layout_utils.h | 244 ++
src/relax/transform/lambda_lift.cc | 312 ++
src/relax/transform/legalize_ops.cc | 138 +
src/relax/transform/lift_transform_params.cc | 328 ++
src/relax/transform/merge_composite_functions.cc | 359 +++
src/relax/transform/meta_schedule.cc | 172 ++
src/relax/transform/normalize.cc | 186 ++
src/relax/transform/rewrite_dataflow_reshape.cc | 107 +
src/relax/transform/run_codegen.cc | 195 ++
src/relax/transform/split_call_tir_by_pattern.cc | 782 +++++
src/relax/transform/static_plan_block_memory.cc | 837 ++++++
src/relax/transform/to_mixed_precision.cc | 538 ++++
src/relax/transform/to_non_dataflow.cc | 67 +
src/relax/transform/tuning_api/database.cc | 350 +++
src/relax/transform/tuning_api/primitives.cc | 273 ++
src/relax/transform/utils.cc | 45 +
src/relax/transform/utils.h | 294 ++
src/relax/utils.cc | 142 +
src/relay/analysis/call_graph.cc | 10 +-
src/relay/analysis/get_calibration_data.cc | 11 +-
src/relay/analysis/match_exhaustion.cc | 8 +-
src/relay/analysis/type_solver.cc | 6 +-
src/relay/backend/annotate_used_memory.cc | 5 +-
src/relay/backend/aot/aot_lower_main.cc | 2 +-
src/relay/backend/aot_executor_codegen.cc | 2 +-
.../backend/contrib/cmsisnn/extract_constants.cc | 8 +-
src/relay/backend/contrib/cmsisnn/relay_to_tir.cc | 16 +-
.../contrib/cmsisnn/scalar_to_tensor_constant.cc | 9 +-
src/relay/backend/contrib/codegen_c/codegen_c.h | 35 +-
src/relay/backend/contrib/cutlass/codegen.h | 21 +
src/relay/backend/contrib/ethosu/codegen.cc | 2 +-
src/relay/backend/contrib/ethosu/preprocess.cc | 8 +-
src/relay/backend/contrib/ethosu/source_module.cc | 4 +-
.../contrib/example_target_hooks/relay_to_tir.cc | 13 +-
src/relay/backend/interpreter.cc | 18 +-
src/relay/backend/te_compiler.cc | 21 +-
src/relay/backend/utils.cc | 7 +
src/relay/backend/vm/compiler.cc | 15 +-
src/relay/backend/vm/lambda_lift.cc | 4 +-
src/relay/backend/vm/removed_unused_funcs.cc | 4 +-
src/relay/collage/mock_cost_estimator.cc | 3 +-
src/relay/collage/sub_graph.cc | 4 +-
src/relay/ir/dataflow_matcher.cc | 4 +-
src/relay/ir/expr_functor.cc | 4 +-
src/relay/ir/function.cc | 4 +-
src/relay/ir/transform.cc | 4 +-
src/relay/op/contrib/ethosu/op_attrs.h | 5 -
src/relay/op/contrib/ethosu/pooling.cc | 19 +-
src/relay/op/memory/memory.cc | 4 +-
src/relay/op/nn/convolution.cc | 18 +-
src/relay/op/tensor/math.cc | 4 -
src/relay/op/type_relations.cc | 4 +-
src/relay/parser/parser.cc | 4 +-
src/relay/printer/relay_text_printer.cc | 24 +-
src/relay/transforms/canonicalize_cast.cc | 4 +-
src/relay/transforms/compiler_function_utils.cc | 4 +-
src/relay/transforms/dead_code.cc | 4 +-
src/relay/transforms/device_planner.cc | 35 +-
src/relay/transforms/dynamic_to_static.cc | 4 +-
src/relay/transforms/eta_expand.cc | 8 +-
src/relay/transforms/fold_constant.cc | 4 +-
src/relay/transforms/fuse_ops.cc | 4 +-
src/relay/transforms/higher_order_gradient.cc | 16 +-
src/relay/transforms/inline.cc | 8 +-
src/relay/transforms/lazy_gradient_init.cc | 4 +-
src/relay/transforms/partial_eval.cc | 8 +-
src/relay/transforms/partition_graph.cc | 20 +-
src/relay/transforms/simplify_expr.cc | 4 +-
src/relay/transforms/type_infer.cc | 10 +-
src/runtime/contrib/cblas/gemm_common.h | 16 +-
src/runtime/contrib/cublas/cublas.cc | 118 +-
src/runtime/contrib/cublas/cublas_json_runtime.cc | 118 +
src/runtime/contrib/cublas/cublas_utils.h | 6 +
src/runtime/debug.cc | 8 +-
src/runtime/hexagon/hexagon_module.h | 5 +-
src/runtime/library_module.cc | 5 +-
src/runtime/metal/metal_module.mm | 14 +-
src/runtime/module.cc | 4 +
src/runtime/relax_vm/attention_kv_cache.cc | 162 +
src/runtime/relax_vm/builtin.cc | 488 +++
src/runtime/relax_vm/bytecode.cc | 68 +
src/runtime/relax_vm/cuda/cuda_graph_builtin.cc | 191 ++
src/runtime/relax_vm/executable.cc | 583 ++++
src/runtime/relax_vm/memory_manager.cc | 181 ++
src/runtime/relax_vm/naive_allocator.h | 65 +
src/runtime/relax_vm/pooled_allocator.h | 111 +
src/runtime/relax_vm/vm.cc | 1038 +++++++
src/runtime/static_library.cc | 2 +-
src/runtime/vulkan/vulkan_wrapped_func.h | 5 -
src/script/ir_builder/ir/frame.cc | 2 +-
src/script/ir_builder/ir/ir.cc | 44 +-
src/script/ir_builder/ir/utils.h | 11 +
src/script/ir_builder/relax/frame.cc | 273 ++
src/script/ir_builder/relax/ir.cc | 245 ++
src/script/ir_builder/relax/utils.h | 119 +
src/script/ir_builder/tir/ir.cc | 36 +-
src/script/printer/doc_printer/base_doc_printer.cc | 96 +-
.../printer/doc_printer/python_doc_printer.cc | 6 +-
src/script/printer/ir/ir.cc | 27 +-
src/script/printer/ir_docsifier.cc | 4 +
src/script/printer/relax/binding.cc | 87 +
src/script/printer/relax/call.cc | 211 ++
src/script/printer/relax/expr.cc | 136 +
src/script/printer/relax/function.cc | 79 +
src/script/printer/relax/region.cc | 100 +
src/script/printer/relax/struct_info.cc | 161 +
src/script/printer/relax/tir.cc | 114 +
src/script/printer/relax/type.cc | 89 +
src/script/printer/relax/utils.h | 103 +
src/script/printer/tir/block.cc | 6 +-
src/script/printer/tir/expr.cc | 39 +-
src/script/printer/tir/function.cc | 3 +-
src/script/printer/utils.h | 10 +-
src/support/array.h | 27 +-
src/target/build_common.h | 12 +-
src/target/intrin_rule.cc | 2 +-
src/target/llvm/codegen_amdgpu.cc | 2 -
src/target/llvm/codegen_blob.cc | 7 +-
src/target/llvm/codegen_cpu.cc | 34 +-
src/target/llvm/codegen_llvm.cc | 34 +-
src/target/llvm/codegen_llvm.h | 6 -
src/target/llvm/codegen_nvptx.cc | 4 +-
src/target/llvm/llvm_module.cc | 20 +-
src/target/source/codegen_c.cc | 4 +-
src/target/source/codegen_cuda.cc | 11 +-
src/target/source/codegen_metal.cc | 22 +-
src/target/source/codegen_webgpu.cc | 255 +-
src/target/source/codegen_webgpu.h | 7 +-
src/target/source/ptx.cc | 46 +-
src/target/source/source_module.cc | 10 +-
src/target/spirv/intrin_rule_spirv.cc | 5 +
src/target/tag.cc | 4 +-
src/target/target.cc | 32 +-
src/target/target_kind.cc | 7 +-
src/te/operation/create_primfunc.cc | 97 +-
src/te/operation/create_primfunc.h | 17 +
src/tir/analysis/block_access_region_detector.cc | 4 +-
src/tir/analysis/calculate_allocated_memory.cc | 5 +-
src/tir/analysis/control_flow_graph.cc | 8 +-
src/tir/analysis/estimate_flops.cc | 8 +-
src/tir/analysis/identify_memcpy.cc | 8 +-
src/tir/analysis/side_effect.cc | 4 +-
src/tir/analysis/verify_gpu_code.cc | 5 +-
src/tir/analysis/verify_memory.cc | 5 +-
src/tir/analysis/verify_ssa.cc | 5 +-
src/tir/ir/data_type_rewriter.cc | 63 +-
src/tir/ir/expr.cc | 2 +-
src/tir/ir/stmt_functor.cc | 16 +-
src/tir/ir/transform.cc | 15 +-
src/tir/op/builtin.cc | 12 +
src/tir/op/runtime.cc | 41 +
src/tir/schedule/analysis.h | 10 -
src/tir/schedule/analysis/analysis.cc | 27 -
src/tir/schedule/analysis/layout.cc | 4 +-
src/tir/schedule/analysis/reducer.cc | 22 +-
src/tir/schedule/concrete_schedule.cc | 26 -
src/tir/schedule/concrete_schedule.h | 3 -
src/tir/schedule/primitive.h | 31 -
src/tir/schedule/primitive/annotate.cc | 16 +-
src/tir/schedule/primitive/blockize_tensorize.cc | 8 +-
src/tir/schedule/primitive/cache_read_write.cc | 4 +-
src/tir/schedule/primitive/compute_at.cc | 16 +-
src/tir/schedule/primitive/decompose_padding.cc | 4 +-
src/tir/schedule/primitive/get_block_loop.cc | 38 +-
src/tir/schedule/primitive/loop_transformation.cc | 202 +-
.../schedule/primitive/reorder_block_iter_var.cc | 148 -
src/tir/schedule/schedule.cc | 81 +-
src/tir/schedule/state.cc | 5 +-
src/tir/schedule/trace.cc | 12 +-
src/tir/schedule/traced_schedule.cc | 30 -
src/tir/schedule/traced_schedule.h | 3 -
src/tir/transforms/bf16_legalize.cc | 16 +-
src/tir/transforms/compact_buffer_region.cc | 14 +-
src/tir/transforms/default_gpu_schedule.cc | 133 +
src/tir/transforms/extract_constants.cc | 5 +-
src/tir/transforms/flatten_buffer.cc | 40 +-
src/tir/transforms/force_narrow_index_to_i32.cc | 94 +
src/tir/transforms/hoist_expression.cc | 4 +-
src/tir/transforms/inject_ptx_async_copy.cc | 31 +-
src/tir/transforms/inject_rolling_buffer.cc | 6 +-
src/tir/transforms/ir_utils.cc | 186 +-
src/tir/transforms/lower_async_dma.cc | 5 +-
src/tir/transforms/lower_custom_datatypes.cc | 4 +-
src/tir/transforms/lower_opaque_block.cc | 8 +-
src/tir/transforms/lower_tvm_builtin.cc | 169 +-
src/tir/transforms/make_packed_api.cc | 6 +-
src/tir/transforms/make_unpacked_api.cc | 4 +-
src/tir/transforms/memhammer_lower_auto_copy.cc | 4 +-
src/tir/transforms/profile_instrumentation.cc | 9 +-
.../reduce_branching_through_overcompute.cc | 4 +-
src/tir/transforms/remap_thread_axis.cc | 27 +-
src/tir/transforms/renew_defs.cc | 4 +-
src/tir/transforms/rewrite_unsafe_select.cc | 4 +-
src/tir/transforms/split_host_device.cc | 23 +-
src/tir/transforms/thread_storage_sync.cc | 2 +-
src/tir/transforms/unify_thread_binding.cc | 17 +-
src/tir/transforms/vectorize_loop.cc | 4 +-
src/topi/einsum.cc | 36 +-
src/topi/transform.cc | 8 +
tests/cpp/nested_msg_test.cc | 319 ++
tests/lint/check_file_type.py | 1 +
.../contrib/test_arm_compute_lib/test_add.py | 31 +-
.../contrib/test_arm_compute_lib/test_conv2d.py | 50 -
.../contrib/test_arm_compute_lib/test_dense.py | 43 -
.../test_clml/test_adreno_collage_targets.py | 2 +-
.../cascader/test_ethosu_pooling_matcher.py | 1 -
tests/python/contrib/test_ethosu/infra.py | 20 +-
tests/python/contrib/test_ethosu/test_codegen.py | 105 +-
.../contrib/test_ethosu/test_identity_optimizer.py | 14 +-
.../contrib/test_ethosu/test_layout_optimizer.py | 91 +-
tests/python/contrib/test_ethosu/test_legalize.py | 58 +-
.../test_pass_operations_distribution.py | 8 +-
.../contrib/test_ethosu/test_replace_pooling.py | 11 +-
.../contrib/test_ethosu/test_type_inference.py | 9 +-
.../test_hexagon/test_2d_physical_buffers.py | 2 +-
.../test_hexagon/test_async_dma_pipeline.py | 18 +
tests/python/contrib/test_hexagon/test_autotvm.py | 24 +-
.../contrib/test_hexagon/test_maxpool2d_blocked.py | 2 +-
.../contrib/test_hexagon/test_relax_integration.py | 236 ++
.../test_hexagon/test_wo_qnn_canonicalization.py | 70 +
tests/python/driver/tvmc/conftest.py | 2 +-
tests/python/frontend/paddlepaddle/test_forward.py | 25 +-
tests/python/relax/conftest.py | 23 +
tests/python/relax/test_analysis.py | 438 +++
.../python/relax/test_analysis_detect_recursion.py | 453 +++
.../relax/test_analysis_estimate_memory_usage.py | 126 +
.../relax/test_analysis_struct_info_analysis.py | 602 ++++
.../test_analysis_suggest_layout_transforms.py | 831 ++++++
tests/python/relax/test_analysis_well_formed.py | 537 ++++
tests/python/relax/test_ast_printer.py | 700 +++++
.../relax/test_backend_transform_shape_lower.py | 431 +++
tests/python/relax/test_binding_rewrite.py | 336 +++
tests/python/relax/test_blockbuilder_core.py | 583 ++++
tests/python/relax/test_blockbuilder_emit_te.py | 71 +
tests/python/relax/test_codegen_cublas.py | 156 +
tests/python/relax/test_codegen_cutlass.py | 718 +++++
tests/python/relax/test_codegen_dnnl.py | 98 +
tests/python/relax/test_codegen_tensorrt.py | 108 +
tests/python/relax/test_codegen_tir_cutlass.py | 709 +++++
tests/python/relax/test_dataflow_pattern.py | 1186 ++++++++
tests/python/relax/test_e2e_op_dynamic.py | 104 +
tests/python/relax/test_expr.py | 258 ++
tests/python/relax/test_expr_args_converter.py | 146 +
tests/python/relax/test_expr_functor.py | 746 +++++
tests/python/relax/test_frontend_common.py | 41 +
tests/python/relax/test_frontend_dynamo.py | 369 +++
tests/python/relax/test_frontend_from_fx.py | 2746 +++++++++++++++++
tests/python/relax/test_op_binary.py | 213 ++
tests/python/relax/test_op_create.py | 707 +++++
tests/python/relax/test_op_datatype.py | 122 +
tests/python/relax/test_op_grad.py | 96 +
tests/python/relax/test_op_gradient_numeric.py | 794 +++++
tests/python/relax/test_op_image.py | 245 ++
tests/python/relax/test_op_index.py | 874 ++++++
tests/python/relax/test_op_linear_algebra.py | 327 ++
tests/python/relax/test_op_manipulate.py | 3137 ++++++++++++++++++++
tests/python/relax/test_op_misc.py | 124 +
tests/python/relax/test_op_nn.py | 1761 +++++++++++
tests/python/relax/test_op_nn_convolution.py | 1190 ++++++++
tests/python/relax/test_op_nn_pooling.py | 655 ++++
tests/python/relax/test_op_search.py | 436 +++
tests/python/relax/test_op_set.py | 862 ++++++
tests/python/relax/test_op_statistical.py | 266 ++
tests/python/relax/test_op_ternary.py | 162 +
tests/python/relax/test_op_unary.py | 205 ++
tests/python/relax/test_pipeline.py | 101 +
tests/python/relax/test_relax_operators.py | 235 ++
tests/python/relax/test_relay_translator.py | 312 ++
tests/python/relax/test_runtime_builtin.py | 170 ++
tests/python/relax/test_struct_info.py | 241 ++
tests/python/relax/test_testing_nn.py | 60 +
tests/python/relax/test_transform.py | 181 ++
tests/python/relax/test_transform_alter_op_impl.py | 342 +++
.../test_transform_annotate_tir_op_pattern.py | 406 +++
.../relax/test_transform_attach_global_symbol.py | 88 +
tests/python/relax/test_transform_bind_params.py | 127 +
.../relax/test_transform_canonicalize_bindings.py | 224 ++
tests/python/relax/test_transform_codegen_pass.py | 254 ++
.../test_transform_combine_parallel_matmul.py | 469 +++
.../python/relax/test_transform_convert_layout.py | 1406 +++++++++
tests/python/relax/test_transform_cse.py | 186 ++
.../relax/test_transform_dead_code_elimination.py | 452 +++
tests/python/relax/test_transform_decompose_ops.py | 391 +++
tests/python/relax/test_transform_fold_constant.py | 454 +++
tests/python/relax/test_transform_fuse_ops.py | 1391 +++++++++
.../relax/test_transform_fuse_ops_by_pattern.py | 752 +++++
tests/python/relax/test_transform_fuse_tir.py | 1007 +++++++
tests/python/relax/test_transform_gradient.py | 1164 ++++++++
.../relax/test_transform_gradient_numeric.py | 192 ++
tests/python/relax/test_transform_lambda_lift.py | 307 ++
tests/python/relax/test_transform_legalize_ops.py | 264 ++
.../relax/test_transform_legalize_ops_binary.py | 1611 ++++++++++
.../test_transform_legalize_ops_create_datatype.py | 860 ++++++
.../relax/test_transform_legalize_ops_grad.py | 381 +++
.../relax/test_transform_legalize_ops_image.py | 103 +
..._transform_legalize_ops_index_linear_algebra.py | 985 ++++++
.../test_transform_legalize_ops_manipulate.py | 1622 ++++++++++
.../python/relax/test_transform_legalize_ops_nn.py | 2940 ++++++++++++++++++
...st_transform_legalize_ops_search_statistical.py | 1143 +++++++
.../relax/test_transform_legalize_ops_unary.py | 1244 ++++++++
.../relax/test_transform_lift_transform_params.py | 540 ++++
.../test_transform_merge_composite_functions.py | 1070 +++++++
.../relax/test_transform_meta_schedule_tuning.py | 187 ++
tests/python/relax/test_transform_normalize.py | 556 ++++
.../test_transform_rewrite_dataflow_reshape.py | 262 ++
.../test_transform_static_plan_block_memory.py | 1013 +++++++
.../relax/test_transform_to_mixed_precision.py | 845 ++++++
tests/python/relax/test_tuning_api.py | 782 +++++
tests/python/relax/test_tvmscript_ir_builder.py | 177 ++
tests/python/relax/test_tvmscript_parser.py | 1383 +++++++++
.../relax/test_tvmscript_parser_op_arith_cmp.py | 183 ++
.../relax/test_tvmscript_parser_op_create.py | 176 ++
.../relax/test_tvmscript_parser_op_datatype.py | 54 +
.../python/relax/test_tvmscript_parser_op_grad.py | 142 +
.../python/relax/test_tvmscript_parser_op_image.py | 54 +
.../python/relax/test_tvmscript_parser_op_index.py | 82 +
.../test_tvmscript_parser_op_linear_algebra.py | 96 +
.../relax/test_tvmscript_parser_op_manipulate.py | 407 +++
tests/python/relax/test_tvmscript_parser_op_nn.py | 350 +++
.../relax/test_tvmscript_parser_op_search.py | 105 +
tests/python/relax/test_tvmscript_parser_op_set.py | 68 +
.../relax/test_tvmscript_parser_op_statistical.py | 189 ++
tests/python/relax/test_tvmscript_printer_relax.py | 533 ++++
tests/python/relax/test_utils.py | 107 +
tests/python/relax/test_vm_build.py | 910 ++++++
tests/python/relax/test_vm_codegen_only.py | 335 +++
tests/python/relax/test_vm_codegen_tir.py | 224 ++
tests/python/relax/test_vm_cuda_graph.py | 108 +
tests/python/relax/test_vm_execbuilder.py | 262 ++
tests/python/relax/test_vm_instrument.py | 87 +
tests/python/relax/test_vm_profiler.py | 130 +
.../relay/collage/demo_collage_partitioner.py | 2 +-
tests/python/relay/test_any.py | 32 +-
tests/python/relay/test_op_grad_level2.py | 2 +-
tests/python/relay/test_op_level2.py | 10 +-
tests/python/relay/test_op_level5.py | 8 +-
.../test_pass_fake_quantization_to_integer.py | 14 -
tests/python/relay/test_pass_merge_composite.py | 47 +-
tests/python/relay/test_vm.py | 10 +-
tests/python/topi/python/test_topi_einsum.py | 52 +-
tests/python/topi/python/test_topi_group_norm.py | 3 +-
tests/python/topi/python/test_topi_layer_norm.py | 3 +-
tests/python/topi/python/test_topi_reduce.py | 1 +
tests/python/topi/python/test_topi_transform.py | 57 +
tests/python/topi/python/test_topi_vision.py | 78 +-
.../unittest/test_arith_canonical_simplify.py | 36 -
tests/python/unittest/test_arith_deduce_bound.py | 10 +-
tests/python/unittest/test_arith_detect_cse.py | 6 +-
.../python/unittest/test_arith_iter_affine_map.py | 131 +-
.../python/unittest/test_arith_rewrite_simplify.py | 68 +-
tests/python/unittest/test_arith_simplify.py | 31 -
.../unittest/test_arith_solve_linear_inequality.py | 15 -
.../python/unittest/test_autotvm_xgboost_model.py | 4 +-
.../unittest/test_meta_schedule_cost_model.py | 26 -
...schedule_feature_extractor_per_store_feature.py | 22 -
.../test_meta_schedule_postproc_rewrite_layout.py | 12 +-
...meta_schedule_postproc_rewrite_unbound_block.py | 58 +-
...meta_schedule_schedule_cuda_layout_transform.py | 1 +
...chedule_schedule_rule_cross_thread_reduction.py | 14 +-
.../test_meta_schedule_schedule_rule_mlt.py | 100 +-
.../test_meta_schedule_schedule_rule_mlt_intrin.py | 151 +-
.../test_meta_schedule_schedule_rule_mlt_tc.py | 118 +-
.../unittest/test_meta_schedule_space_cpu.py | 2492 ++++++++--------
.../unittest/test_meta_schedule_space_cuda.py | 1018 ++++---
.../test_meta_schedule_space_cuda_async.py | 115 +-
.../test_meta_schedule_space_cuda_winograd.py | 402 +--
.../unittest/test_meta_schedule_trace_apply.py | 58 +-
tests/python/unittest/test_object_path.py | 10 -
tests/python/unittest/test_target_codegen_llvm.py | 20 -
.../unittest/test_tir_reorder_block_iter_var.py | 86 -
.../unittest/test_tir_schedule_compute_at.py | 41 -
tests/python/unittest/test_tir_schedule_merge.py | 273 --
.../python/unittest/test_tir_schedule_reduction.py | 54 -
.../python/unittest/test_tir_schedule_utilities.py | 59 -
.../unittest/test_tir_structural_equal_hash.py | 35 +-
.../test_tir_transform_compact_buffer_region.py | 85 +-
.../unittest/test_tir_transform_flatten_buffer.py | 45 -
...test_tir_transform_force_narrow_index_to_i32.py | 241 ++
.../python/unittest/test_tir_transform_helpers.py | 30 +-
.../test_tir_transform_inject_ptx_async_copy.py | 537 +---
.../test_tir_transform_split_host_device.py | 25 -
.../test_tir_transform_unify_thread_binding.py | 25 -
.../test_transform_default_gpu_schedule.py | 455 +++
tests/python/unittest/test_tvmscript_printer_ir.py | 17 +-
tests/python/unittest/test_tvmscript_roundtrip.py | 81 +-
.../python/unittest/test_tvmscript_syntax_sugar.py | 25 -
tests/scripts/request_hook/request_hook.py | 1 +
tests/scripts/task_config_build_cpu.sh | 2 +-
tests/scripts/task_lint.sh | 4 +-
tests/scripts/unity/README | 2 +
tests/scripts/unity/task_extra_lint.sh | 23 +
tests/scripts/unity/task_python_relax.sh | 37 +
tests/scripts/unity/task_python_relax_gpuonly.sh | 25 +
version.py | 2 +-
vta/scripts/tune_resnet.py | 24 +-
vta/tutorials/autotvm/tune_alu_vta.py | 24 +-
vta/tutorials/autotvm/tune_relay_vta.py | 24 +-
web/.gitignore | 1 +
web/apps/browser/rpc_plugin.html | 19 +
web/apps/browser/rpc_server.html | 78 +-
web/emcc/wasm_runtime.cc | 110 +
web/emcc/webgpu_runtime.cc | 34 +-
web/package.json | 2 +-
web/src/rpc_server.ts | 54 +-
web/src/runtime.ts | 517 +++-
web/src/webgpu.ts | 680 ++++-
web/tests/node/test_relax_vm.js | 67 +
web/tests/python/prepare_test_libs.py | 30 +-
web/tests/python/relax_rpc_test.py | 87 +
web/tests/python/webgpu_rpc_test.py | 32 +-
web/tests/python/websock_rpc_test.py | 4 +-
859 files changed, 138984 insertions(+), 9900 deletions(-)
create mode 100644 apps/relax_examples/e2e_auto_tir.py
create mode 100644 apps/relax_examples/mlp.py
create mode 100644 apps/relax_examples/nn_module.py
create mode 100644 apps/relax_examples/resnet.py
create mode 100644 ci/jenkins/unity_jenkinsfile.groovy
delete mode 100644 docker/install/ubuntu_install_jax.sh
delete mode 100644 docs/reference/api/python/meta_schedule.rst
create mode 100644 include/tvm/ir/global_info.h
create mode 100644 include/tvm/relax/analysis.h
create mode 100644 include/tvm/relax/attrs/create.h
create mode 100644 include/tvm/relax/attrs/datatype.h
create mode 100644 include/tvm/relax/attrs/image.h
create mode 100644 include/tvm/relax/attrs/index.h
create mode 100644 include/tvm/relax/attrs/linear_algebra.h
create mode 100644 include/tvm/relax/attrs/manipulate.h
create mode 100644 include/tvm/relax/attrs/nn.h
create mode 100644 include/tvm/relax/attrs/search.h
create mode 100644 include/tvm/relax/attrs/statistical.h
create mode 100644 include/tvm/relax/backend.h
create mode 100644 include/tvm/relax/binding_rewrite.h
create mode 100644 include/tvm/relax/block_builder.h
create mode 100644 include/tvm/relax/dataflow_matcher.h
create mode 100644 include/tvm/relax/dataflow_pattern.h
create mode 100644 include/tvm/relax/dataflow_pattern_functor.h
create mode 100644 include/tvm/relax/exec_builder.h
create mode 100644 include/tvm/relax/expr.h
create mode 100644 include/tvm/relax/expr_functor.h
create mode 100644 include/tvm/relax/nested_msg.h
create mode 100644 include/tvm/relax/op_attr_types.h
create mode 100644 include/tvm/relax/struct_info.h
create mode 100644 include/tvm/relax/struct_info_functor.h
create mode 100644 include/tvm/relax/tir_pattern.h
create mode 100644 include/tvm/relax/transform.h
create mode 100644 include/tvm/relax/tuning_api.h
create mode 100644 include/tvm/relax/type.h
create mode 100644 include/tvm/relax/utils.h
create mode 100644 include/tvm/runtime/relax_vm/builtin.h
create mode 100644 include/tvm/runtime/relax_vm/bytecode.h
create mode 100644 include/tvm/runtime/relax_vm/executable.h
create mode 100644 include/tvm/runtime/relax_vm/memory_manager.h
create mode 100644 include/tvm/runtime/relax_vm/vm.h
create mode 100644 include/tvm/script/ir_builder/relax/frame.h
create mode 100644 include/tvm/script/ir_builder/relax/ir.h
create mode 100644 python/tvm/contrib/cutlass/attention_operation.py
create mode 100644 python/tvm/contrib/tvmjs.py
create mode 100644 python/tvm/ir/global_info.py
create mode 100644 python/tvm/meta_schedule/relax_integration.py
create mode 100644 python/tvm/relax/__init__.py
create mode 100644 python/tvm/relax/_ffi_api.py
create mode 100644 python/tvm/relax/analysis/__init__.py
create mode 100644 python/tvm/relax/analysis/_ffi_api.py
create mode 100644 python/tvm/relax/analysis/analysis.py
create mode 100644 python/tvm/relax/analysis/estimate_memory_usage.py
create mode 100644 python/tvm/relax/backend/__init__.py
create mode 100644 python/tvm/relax/backend/_ffi_api.py
create mode 100644 python/tvm/relax/backend/contrib/__init__.py
create mode 100644 python/tvm/relax/backend/contrib/cublas.py
create mode 100644 python/tvm/relax/backend/contrib/cutlass.py
create mode 100644 python/tvm/relax/backend/pattern_registry.py
create mode 100644 python/tvm/relax/backend/patterns.py
create mode 100644 python/tvm/relax/backend_tir/__init__.py
create mode 100644 python/tvm/relax/backend_tir/contrib/__init__.py
create mode 100644 python/tvm/relax/backend_tir/contrib/cutlass.py
create mode 100644 python/tvm/relax/backend_tir/pattern.py
create mode 100644 python/tvm/relax/binding_rewrite.py
create mode 100644 python/tvm/relax/block_builder.py
create mode 100644 python/tvm/relax/dpl/__init__.py
create mode 100644 python/tvm/relax/dpl/_ffi.py
create mode 100644 python/tvm/relax/dpl/context.py
create mode 100644 python/tvm/relax/dpl/pattern.py
create mode 100644 python/tvm/relax/dpl/rewrite.py
create mode 100644 python/tvm/relax/exec_builder.py
create mode 100644 python/tvm/relax/expr.py
create mode 100644 python/tvm/relax/expr_functor.py
create mode 100644 python/tvm/relax/frontend/__init__.py
create mode 100644 python/tvm/relax/frontend/common.py
create mode 100644 python/tvm/relax/frontend/torch/__init__.py
create mode 100644 python/tvm/relax/frontend/torch/dynamo.py
create mode 100644 python/tvm/relax/frontend/torch/fx_translator.py
create mode 100644 python/tvm/relax/ir/instrument.py
create mode 100644 python/tvm/relax/op/__init__.py
create mode 100644 python/tvm/relax/op/_ffi_api.py
create mode 100644 python/tvm/relax/op/_op_gradient.py
create mode 100644 python/tvm/relax/op/base.py
create mode 100644 python/tvm/relax/op/binary.py
create mode 100644 python/tvm/relax/op/builtin/__init__.py
create mode 100644 python/tvm/relax/op/builtin/_ffi_api.py
create mode 100644 python/tvm/relax/op/builtin/builtin.py
create mode 100644 python/tvm/relax/op/create.py
create mode 100644 python/tvm/relax/op/datatype.py
create mode 100644 python/tvm/relax/op/grad/__init__.py
create mode 100644 python/tvm/relax/op/grad/_ffi_api.py
create mode 100644 python/tvm/relax/op/grad/grad.py
create mode 100644 python/tvm/relax/op/image/__init__.py
create mode 100644 python/tvm/relax/op/image/_ffi_api.py
create mode 100644 python/tvm/relax/op/image/image.py
create mode 100644 python/tvm/relax/op/index.py
create mode 100644 python/tvm/relax/op/linear_algebra.py
create mode 100644 python/tvm/relax/op/manipulate.py
create mode 100644 python/tvm/relax/op/memory/__init__.py
create mode 100644 python/tvm/relax/op/memory/_ffi_api.py
create mode 100644 python/tvm/relax/op/memory/memory.py
create mode 100644 python/tvm/relax/op/nn/__init__.py
create mode 100644 python/tvm/relax/op/nn/_ffi_api.py
create mode 100644 python/tvm/relax/op/nn/nn.py
create mode 100644 python/tvm/relax/op/op_attrs.py
create mode 100644 python/tvm/relax/op/search.py
create mode 100644 python/tvm/relax/op/set.py
create mode 100644 python/tvm/relax/op/statistical.py
create mode 100644 python/tvm/relax/op/ternary.py
create mode 100644 python/tvm/relax/op/unary.py
create mode 100644 python/tvm/relax/op/vm/__init__.py
create mode 100644 python/tvm/relax/op/vm/_ffi_api.py
create mode 100644 python/tvm/relax/op/vm/vm.py
create mode 100644 python/tvm/relax/pipeline.py
create mode 100644 python/tvm/relax/struct_info.py
create mode 100644 python/tvm/relax/testing/__init__.py
create mode 100644 python/tvm/relax/testing/ast_printer.py
create mode 100644 python/tvm/relax/testing/lib_comparator.py
create mode 100644 python/tvm/relax/testing/matmul.py
create mode 100644 python/tvm/relax/testing/nn.py
create mode 100644 python/tvm/relax/testing/relay_translator.py
create mode 100644 python/tvm/relax/testing/runtime_builtin.py
create mode 100644 python/tvm/relax/testing/transform.py
create mode 100644 python/tvm/relax/testing/vm.py
create mode 100644 python/tvm/relax/transform/__init__.py
create mode 100644 python/tvm/relax/transform/_ffi_api.py
create mode 100644 python/tvm/relax/transform/legalize_ops/__init__.py
create mode 100644 python/tvm/relax/transform/legalize_ops/binary.py
create mode 100644 python/tvm/relax/transform/legalize_ops/common.py
create mode 100644 python/tvm/relax/transform/legalize_ops/creation.py
create mode 100644 python/tvm/relax/transform/legalize_ops/datatype.py
create mode 100644 python/tvm/relax/transform/legalize_ops/grad.py
create mode 100644 python/tvm/relax/transform/legalize_ops/image.py
create mode 100644 python/tvm/relax/transform/legalize_ops/index.py
create mode 100644 python/tvm/relax/transform/legalize_ops/linear_algebra.py
create mode 100644 python/tvm/relax/transform/legalize_ops/manipulate.py
create mode 100644 python/tvm/relax/transform/legalize_ops/nn.py
create mode 100644 python/tvm/relax/transform/legalize_ops/search.py
create mode 100644 python/tvm/relax/transform/legalize_ops/statistical.py
create mode 100644 python/tvm/relax/transform/legalize_ops/unary.py
create mode 100644 python/tvm/relax/transform/transform.py
create mode 100644 python/tvm/relax/transform/tuning_api/__init__.py
create mode 100644 python/tvm/relax/transform/tuning_api/_ffi_api.py
create mode 100644 python/tvm/relax/transform/tuning_api/database.py
create mode 100644 python/tvm/relax/transform/tuning_api/default_functions.py
create mode 100644 python/tvm/relax/transform/tuning_api/primitives.py
create mode 100644 python/tvm/relax/ty.py
create mode 100644 python/tvm/relax/utils.py
create mode 100644 python/tvm/relax/vm_build.py
create mode 100644 python/tvm/runtime/relax_vm.py
create mode 100644 python/tvm/script/ir_builder/relax/__init__.py
create mode 100644 python/tvm/script/ir_builder/relax/_ffi_api.py
create mode 100644 python/tvm/script/ir_builder/relax/frame.py
create mode 100644 python/tvm/script/ir_builder/relax/ir.py
create mode 100644 python/tvm/script/parser/relax/__init__.py
create mode 100644 python/tvm/script/parser/relax/entry.py
create mode 100644 python/tvm/script/parser/relax/parser.py
create mode 100644 python/tvm/script/relax.py
create mode 100644 python/tvm/script/tir.py
delete mode 100644 src/arith/product_normal_form.h
create mode 100644 src/ir/global_info.cc
create mode 100644 src/relax/analysis/analysis.cc
create mode 100644 src/relax/analysis/detect_recursion.cc
create mode 100644 src/relax/analysis/layout_transformation.cc
create mode 100644 src/relax/analysis/shape_analysis.cc
create mode 100644 src/relax/analysis/struct_info_analysis.cc
create mode 100644 src/relax/analysis/tir_op_pattern_kind.cc
create mode 100644 src/relax/analysis/udchain.cc
create mode 100644 src/relax/analysis/var2value.cc
create mode 100644 src/relax/analysis/well_formed.cc
create mode 100644 src/relax/backend/contrib/codegen_json/codegen_json.h
create mode 100644 src/relax/backend/contrib/cublas/codegen.cc
create mode 100644 src/relax/backend/contrib/cutlass/codegen.cc
create mode 100644 src/relax/backend/contrib/dnnl/codegen.cc
create mode 100644 src/relax/backend/contrib/tensorrt/codegen.cc
create mode 100644 src/relax/backend/contrib/utils.cc
create mode 100644 src/relax/backend/contrib/utils.h
create mode 100644 src/relax/backend/pattern_registry.cc
create mode 100644 src/relax/backend/pattern_registry.h
create mode 100644 src/relax/backend/task_extraction.cc
create mode 100644 src/relax/backend/vm/codegen_vm.cc
create mode 100644 src/relax/backend/vm/codegen_vm_tir.cc
create mode 100644 src/relax/backend/vm/exec_builder.cc
create mode 100644 src/relax/backend/vm/vm_builtin_lower.cc
create mode 100644 src/relax/backend/vm/vm_shape_lower.cc
create mode 100644 src/relax/ir/binding_rewrite.cc
create mode 100644 src/relax/ir/block_builder.cc
create mode 100644 src/relax/ir/dataflow_matcher.cc
create mode 100644 src/relax/ir/dataflow_matcher_impl.h
create mode 100644 src/relax/ir/dataflow_pattern.cc
create mode 100644 src/relax/ir/dataflow_pattern_functor.cc
create mode 100644 src/relax/ir/emit_te.cc
create mode 100644 src/relax/ir/emit_te.h
create mode 100644 src/relax/ir/expr.cc
create mode 100644 src/relax/ir/expr_functor.cc
create mode 100644 src/relax/ir/py_expr_functor.cc
create mode 100644 src/relax/ir/struct_info.cc
create mode 100644 src/relax/ir/struct_info_functor.cc
create mode 100644 src/relax/ir/tir_pattern.cc
create mode 100644 src/relax/ir/transform.cc
create mode 100644 src/relax/ir/type.cc
create mode 100644 src/relax/op/image/resize.cc
create mode 100644 src/relax/op/image/resize.h
create mode 100644 src/relax/op/nn/attention.cc
create mode 100644 src/relax/op/nn/attention.h
create mode 100644 src/relax/op/nn/convolution.cc
create mode 100644 src/relax/op/nn/convolution.h
create mode 100644 src/relax/op/nn/nn.cc
create mode 100644 src/relax/op/nn/nn.h
create mode 100644 src/relax/op/nn/pooling.cc
create mode 100644 src/relax/op/nn/pooling.h
create mode 100644 src/relax/op/op.cc
create mode 100644 src/relax/op/op_common.cc
create mode 100644 src/relax/op/op_common.h
create mode 100644 src/relax/op/tensor/binary.cc
create mode 100644 src/relax/op/tensor/binary.h
create mode 100644 src/relax/op/tensor/create.cc
create mode 100644 src/relax/op/tensor/create.h
create mode 100644 src/relax/op/tensor/datatype.cc
create mode 100644 src/relax/op/tensor/datatype.h
create mode 100644 src/relax/op/tensor/grad.cc
create mode 100644 src/relax/op/tensor/grad.h
create mode 100644 src/relax/op/tensor/index.cc
create mode 100644 src/relax/op/tensor/index.h
create mode 100644 src/relax/op/tensor/linear_algebra.cc
create mode 100644 src/relax/op/tensor/linear_algebra.h
create mode 100644 src/relax/op/tensor/manipulate.cc
create mode 100644 src/relax/op/tensor/manipulate.h
create mode 100644 src/relax/op/tensor/search.cc
create mode 100644 src/relax/op/tensor/search.h
create mode 100644 src/relax/op/tensor/set.cc
create mode 100644 src/relax/op/tensor/set.h
create mode 100644 src/relax/op/tensor/statistical.cc
create mode 100644 src/relax/op/tensor/statistical.h
create mode 100644 src/relax/op/tensor/ternary.cc
create mode 100644 src/relax/op/tensor/ternary.h
create mode 100644 src/relax/op/tensor/unary.cc
create mode 100644 src/relax/op/tensor/unary.h
create mode 100644 src/relax/transform/alter_op_impl.cc
create mode 100644 src/relax/transform/annotate_tir_op_pattern.cc
create mode 100644 src/relax/transform/attach_global_symbol.cc
create mode 100644 src/relax/transform/bind_params.cc
create mode 100644 src/relax/transform/call_tir_rewrite.cc
create mode 100644 src/relax/transform/canonicalize_bindings.cc
create mode 100644 src/relax/transform/combine_parallel_matmul.cc
create mode 100644 src/relax/transform/convert_layout.cc
create mode 100644 src/relax/transform/dead_code_elimination.cc
create mode 100644 src/relax/transform/decompose_ops.cc
create mode 100644 src/relax/transform/eliminate_common_subexpr.cc
create mode 100644 src/relax/transform/fold_constant.cc
create mode 100644 src/relax/transform/fuse_ops.cc
create mode 100644 src/relax/transform/fuse_tir.cc
create mode 100644 src/relax/transform/gradient.cc
create mode 100644 src/relax/transform/infer_amp_utils.cc
create mode 100644 src/relax/transform/infer_amp_utils.h
create mode 100644 src/relax/transform/infer_layout_utils.cc
create mode 100644 src/relax/transform/infer_layout_utils.h
create mode 100644 src/relax/transform/lambda_lift.cc
create mode 100644 src/relax/transform/legalize_ops.cc
create mode 100644 src/relax/transform/lift_transform_params.cc
create mode 100644 src/relax/transform/merge_composite_functions.cc
create mode 100644 src/relax/transform/meta_schedule.cc
create mode 100644 src/relax/transform/normalize.cc
create mode 100644 src/relax/transform/rewrite_dataflow_reshape.cc
create mode 100644 src/relax/transform/run_codegen.cc
create mode 100644 src/relax/transform/split_call_tir_by_pattern.cc
create mode 100644 src/relax/transform/static_plan_block_memory.cc
create mode 100644 src/relax/transform/to_mixed_precision.cc
create mode 100644 src/relax/transform/to_non_dataflow.cc
create mode 100644 src/relax/transform/tuning_api/database.cc
create mode 100644 src/relax/transform/tuning_api/primitives.cc
create mode 100644 src/relax/transform/utils.cc
create mode 100644 src/relax/transform/utils.h
create mode 100644 src/relax/utils.cc
create mode 100644 src/runtime/contrib/cublas/cublas_json_runtime.cc
create mode 100644 src/runtime/relax_vm/attention_kv_cache.cc
create mode 100644 src/runtime/relax_vm/builtin.cc
create mode 100644 src/runtime/relax_vm/bytecode.cc
create mode 100644 src/runtime/relax_vm/cuda/cuda_graph_builtin.cc
create mode 100644 src/runtime/relax_vm/executable.cc
create mode 100644 src/runtime/relax_vm/memory_manager.cc
create mode 100644 src/runtime/relax_vm/naive_allocator.h
create mode 100644 src/runtime/relax_vm/pooled_allocator.h
create mode 100644 src/runtime/relax_vm/vm.cc
create mode 100644 src/script/ir_builder/relax/frame.cc
create mode 100644 src/script/ir_builder/relax/ir.cc
create mode 100644 src/script/ir_builder/relax/utils.h
create mode 100644 src/script/printer/relax/binding.cc
create mode 100644 src/script/printer/relax/call.cc
create mode 100644 src/script/printer/relax/expr.cc
create mode 100644 src/script/printer/relax/function.cc
create mode 100644 src/script/printer/relax/region.cc
create mode 100644 src/script/printer/relax/struct_info.cc
create mode 100644 src/script/printer/relax/tir.cc
create mode 100644 src/script/printer/relax/type.cc
create mode 100644 src/script/printer/relax/utils.h
create mode 100644 src/tir/op/runtime.cc
delete mode 100644 src/tir/schedule/primitive/reorder_block_iter_var.cc
create mode 100644 src/tir/transforms/default_gpu_schedule.cc
create mode 100644 src/tir/transforms/force_narrow_index_to_i32.cc
create mode 100644 tests/cpp/nested_msg_test.cc
create mode 100644 tests/python/contrib/test_hexagon/test_relax_integration.py
create mode 100644 tests/python/relax/conftest.py
create mode 100644 tests/python/relax/test_analysis.py
create mode 100644 tests/python/relax/test_analysis_detect_recursion.py
create mode 100644 tests/python/relax/test_analysis_estimate_memory_usage.py
create mode 100644 tests/python/relax/test_analysis_struct_info_analysis.py
create mode 100644
tests/python/relax/test_analysis_suggest_layout_transforms.py
create mode 100644 tests/python/relax/test_analysis_well_formed.py
create mode 100644 tests/python/relax/test_ast_printer.py
create mode 100644 tests/python/relax/test_backend_transform_shape_lower.py
create mode 100644 tests/python/relax/test_binding_rewrite.py
create mode 100644 tests/python/relax/test_blockbuilder_core.py
create mode 100644 tests/python/relax/test_blockbuilder_emit_te.py
create mode 100644 tests/python/relax/test_codegen_cublas.py
create mode 100644 tests/python/relax/test_codegen_cutlass.py
create mode 100644 tests/python/relax/test_codegen_dnnl.py
create mode 100644 tests/python/relax/test_codegen_tensorrt.py
create mode 100644 tests/python/relax/test_codegen_tir_cutlass.py
create mode 100644 tests/python/relax/test_dataflow_pattern.py
create mode 100644 tests/python/relax/test_e2e_op_dynamic.py
create mode 100644 tests/python/relax/test_expr.py
create mode 100644 tests/python/relax/test_expr_args_converter.py
create mode 100644 tests/python/relax/test_expr_functor.py
create mode 100644 tests/python/relax/test_frontend_common.py
create mode 100644 tests/python/relax/test_frontend_dynamo.py
create mode 100644 tests/python/relax/test_frontend_from_fx.py
create mode 100644 tests/python/relax/test_op_binary.py
create mode 100644 tests/python/relax/test_op_create.py
create mode 100644 tests/python/relax/test_op_datatype.py
create mode 100644 tests/python/relax/test_op_grad.py
create mode 100644 tests/python/relax/test_op_gradient_numeric.py
create mode 100644 tests/python/relax/test_op_image.py
create mode 100644 tests/python/relax/test_op_index.py
create mode 100644 tests/python/relax/test_op_linear_algebra.py
create mode 100644 tests/python/relax/test_op_manipulate.py
create mode 100644 tests/python/relax/test_op_misc.py
create mode 100644 tests/python/relax/test_op_nn.py
create mode 100644 tests/python/relax/test_op_nn_convolution.py
create mode 100644 tests/python/relax/test_op_nn_pooling.py
create mode 100644 tests/python/relax/test_op_search.py
create mode 100644 tests/python/relax/test_op_set.py
create mode 100644 tests/python/relax/test_op_statistical.py
create mode 100644 tests/python/relax/test_op_ternary.py
create mode 100644 tests/python/relax/test_op_unary.py
create mode 100644 tests/python/relax/test_pipeline.py
create mode 100644 tests/python/relax/test_relax_operators.py
create mode 100644 tests/python/relax/test_relay_translator.py
create mode 100644 tests/python/relax/test_runtime_builtin.py
create mode 100644 tests/python/relax/test_struct_info.py
create mode 100644 tests/python/relax/test_testing_nn.py
create mode 100644 tests/python/relax/test_transform.py
create mode 100644 tests/python/relax/test_transform_alter_op_impl.py
create mode 100644 tests/python/relax/test_transform_annotate_tir_op_pattern.py
create mode 100644 tests/python/relax/test_transform_attach_global_symbol.py
create mode 100644 tests/python/relax/test_transform_bind_params.py
create mode 100644 tests/python/relax/test_transform_canonicalize_bindings.py
create mode 100644 tests/python/relax/test_transform_codegen_pass.py
create mode 100644 tests/python/relax/test_transform_combine_parallel_matmul.py
create mode 100644 tests/python/relax/test_transform_convert_layout.py
create mode 100644 tests/python/relax/test_transform_cse.py
create mode 100644 tests/python/relax/test_transform_dead_code_elimination.py
create mode 100644 tests/python/relax/test_transform_decompose_ops.py
create mode 100644 tests/python/relax/test_transform_fold_constant.py
create mode 100644 tests/python/relax/test_transform_fuse_ops.py
create mode 100644 tests/python/relax/test_transform_fuse_ops_by_pattern.py
create mode 100644 tests/python/relax/test_transform_fuse_tir.py
create mode 100644 tests/python/relax/test_transform_gradient.py
create mode 100644 tests/python/relax/test_transform_gradient_numeric.py
create mode 100644 tests/python/relax/test_transform_lambda_lift.py
create mode 100644 tests/python/relax/test_transform_legalize_ops.py
create mode 100644 tests/python/relax/test_transform_legalize_ops_binary.py
create mode 100644
tests/python/relax/test_transform_legalize_ops_create_datatype.py
create mode 100644 tests/python/relax/test_transform_legalize_ops_grad.py
create mode 100644 tests/python/relax/test_transform_legalize_ops_image.py
create mode 100644
tests/python/relax/test_transform_legalize_ops_index_linear_algebra.py
create mode 100644 tests/python/relax/test_transform_legalize_ops_manipulate.py
create mode 100644 tests/python/relax/test_transform_legalize_ops_nn.py
create mode 100644
tests/python/relax/test_transform_legalize_ops_search_statistical.py
create mode 100644 tests/python/relax/test_transform_legalize_ops_unary.py
create mode 100644 tests/python/relax/test_transform_lift_transform_params.py
create mode 100644
tests/python/relax/test_transform_merge_composite_functions.py
create mode 100644 tests/python/relax/test_transform_meta_schedule_tuning.py
create mode 100644 tests/python/relax/test_transform_normalize.py
create mode 100644
tests/python/relax/test_transform_rewrite_dataflow_reshape.py
create mode 100644
tests/python/relax/test_transform_static_plan_block_memory.py
create mode 100644 tests/python/relax/test_transform_to_mixed_precision.py
create mode 100644 tests/python/relax/test_tuning_api.py
create mode 100644 tests/python/relax/test_tvmscript_ir_builder.py
create mode 100644 tests/python/relax/test_tvmscript_parser.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_arith_cmp.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_create.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_datatype.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_grad.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_image.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_index.py
create mode 100644
tests/python/relax/test_tvmscript_parser_op_linear_algebra.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_manipulate.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_nn.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_search.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_set.py
create mode 100644 tests/python/relax/test_tvmscript_parser_op_statistical.py
create mode 100644 tests/python/relax/test_tvmscript_printer_relax.py
create mode 100644 tests/python/relax/test_utils.py
create mode 100644 tests/python/relax/test_vm_build.py
create mode 100644 tests/python/relax/test_vm_codegen_only.py
create mode 100644 tests/python/relax/test_vm_codegen_tir.py
create mode 100644 tests/python/relax/test_vm_cuda_graph.py
create mode 100644 tests/python/relax/test_vm_execbuilder.py
create mode 100644 tests/python/relax/test_vm_instrument.py
create mode 100644 tests/python/relax/test_vm_profiler.py
delete mode 100644 tests/python/unittest/test_tir_reorder_block_iter_var.py
delete mode 100644 tests/python/unittest/test_tir_schedule_merge.py
create mode 100644
tests/python/unittest/test_tir_transform_force_narrow_index_to_i32.py
create mode 100644 tests/python/unittest/test_transform_default_gpu_schedule.py
create mode 100644 tests/scripts/unity/README
create mode 100755 tests/scripts/unity/task_extra_lint.sh
create mode 100755 tests/scripts/unity/task_python_relax.sh
create mode 100755 tests/scripts/unity/task_python_relax_gpuonly.sh
create mode 100644 web/apps/browser/rpc_plugin.html
create mode 100644 web/tests/node/test_relax_vm.js
create mode 100644 web/tests/python/relax_rpc_test.py