This is an automated email from the ASF dual-hosted git repository.
tqchen pushed a change to branch unity-staging
in repository https://gitbox.apache.org/repos/asf/tvm.git
omit a425bc7a39 [Unity] Pattern-based rewriting for dataflow block (#14446)
omit 646d50dc27 [Unity][Graph matching] Clean up undo stack for parent and
child nodes properly (#14440)
omit cecc5c3ade [Unity][Op][Docs] Update comment for `call_tir_dyn` (#14441)
omit 41230a981f [Unity][Graph matching] Automatically add `used-by`
constraints for `is_op` pattern (#14439)
omit dc7ba6c46c [Unity] Remove non-deterministic behavior from graph
pattern matching (#14417)
omit 9efc5b83a7 [Unity] Minor updates to DataFlowBlockRewrite (#14431)
omit 784733a425 [Unity][Fix] Annotate TIR op pattern could have no stores.
(#14420)
omit ac90c7af01 [Unity] Include constant shapes in the profiler result
(#14428)
omit ab3299c054 [Unity] Handle extern func calls in static memory planning
(#14419)
omit c5335d96f9 [Unity][Fix] Copy over module attrs in FuseTIR (#14418)
omit d93eb5c091 [Unity][Hexagon] Enable Relax VM for Hexagon (#14415)
omit 30db3de0e7 [Unity][Op] Expose scale in `R.nn.attention` and add its
legalize op (#14412)
omit ef4057a433 [Unity] Fix getting shapes for cutlass BYOC kernels (#14411)
omit c69c75407f [Unity][Op] Conv1d (#14388)
omit f45f11a9e5 [Unity][QNN][Hexagon]Support Relax Constants in the QNN
TOPI operations (#14386)
omit 219ed08e12 [Unity][Transform] Common Subexpression Elimination (#14361)
omit 25608f40c6 [Unity][TVMScript] Fix Shape Var occurrence in Tensor
annotation (#14404)
omit 414514c1bf [Unity][Op] Add stop_lift_params (#14368)
omit f6919620c1 [Unity] Support simple dynamic-shape-aware fusion (#14396)
omit 95b6f680b7 [Unity][Transform] SplitCallTIRByPattern and CUTLASS
backend (#14274)
omit 634cfad0dc [Unity] Add missing #include <array> (#14383)
omit 5a2f1ba2c6 [Unity][VM] Add CUDA graph vm builtins (#14371)
omit 0908a43466 [Unity] Also include output dtype in simt MathInstruction
(#14372)
omit e32164a805 [Unity][Fix] Allow scalar layout initialization (#14370)
omit b5b8e206d6 [Unity][TVMScript] Update GlobalVar `checked_type_` when
`emit_te` (#14367)
omit 5afb3ea5c5 [Unity] Add More Ops For FX Translator (#14348)
omit bc391d3429 [Unity][Fix] Infer Layout must support negative axes
(#14365)
omit 77496c33f3 [Unity][Pass] Fix FuseOps error if there is no output of a
given group (#14354)
omit f38171b0cd [Unity][WEB] Support async pipeline creation (#14362)
omit 84dc90d76b [Unity] Add support to append relay op attrs in translator
(#14356)
omit fc8bbbd6b4 [Unity][Transform] Fix AMP tests (#14360)
omit 3d7af30df7 [Unity][Transform] Introduce data-dependent operation of
reshape and its constant folding (#14282)
omit 9b8e003d50 [Unity][Fix] Fix block memory plan to handle bool (#14357)
omit d108639bce [Unity][Transform] AMP out_dtype=float16 testcases (#14358)
omit a5d659099d [Unity][BYOC] Check leaked intermediate variables in
cutlass patterns (#14350)
omit d623140045 [Unity] Support model kwargs in dynamo_capture_subgraph
(#14349)
omit 602fd10694 [Unity][Frontend] FX exp and strided_slice fix (#14338)
omit 029a5e8793 [Unity][BYOC] Update testcases to follow recent changes
(#14339)
omit de8c12ab3c [Unity] Remove Python interface of RemoveUnusedFunction
(#14336)
omit ccb9074907 [Unity][Pass] Reuse prior infra to implement more complete
DCE (#14334)
omit dd742a826a [Unity][Op] Fix Strided Slice Shape Inference (#14324)
omit aa1932492b [Unity][Transform] DefaultSchedule pass (#14266)
omit 24e0fc7c69 [Unity][Lint] Fix cpplint casting (#14333)
omit 3497cca0b5 [Unity][Transform] Automatic Mixed Precision (#14242)
omit 3b731b2eee [Unity][Transform] Simple Dead Code Elimination (#14262)
omit cdb435ccff [Unity][Transform] Automatic Layout Conversion (#14257)
omit 30817d1aef [Unity][TOPI] fp16 LayerNorm & GroupNorm (#14264)
omit 3e66b205d2 [Unity][Contrib] Introduce several features of cutlass
profiler (#14275)
omit 9cba9bfd7a [Unity][Transform] Enhance RewriteDataflowReshape transform
(#14265)
omit 0145fe97a4 [Unity][BYOC] Improve expressiveness of the pattern check
function in FuseOpsByPattern (#14310)
omit db7fdfd5fa [Unity][BYOC] Support matmul + residual block fusion in
CUTLASS BYOC (#14317)
omit 1a7244135f [Unity] Support pattern-based rewriting (#14312)
omit e61576ba4b [Unity][Web] WebGPU explicit max buffer size (#14321)
omit 0f6463fccb [Unity][Op] Enable special dimension value 0 in reshape
(#14311)
omit a9ca0cf0ab [Unity][Pass] Add a pass to alter the TIR implementation of
an operator (#14215)
omit 1a582b9d79 [Unity][DEBUG] Add Instrument (#14302)
omit 0f49776de3 [Unity][Op] Cumsum (#14297)
omit d394b6a89f [Unity] Fix StructInfo Infer for `vm.alloc_tensor` (#14283)
omit df7f510da8 [Unity] Mark tests that need python3.8 compact.
omit 2ce4af3e0c [TVMScript][Unity] Improve PyLint Compatibility (#14276)
omit 97b429a256 [Unity][ci] Use CPU-SMALL instances (#14256)
omit d268b13cac [Unity] Introduce call_dps_packed (#14183)
omit 96cd5b5b4e [Unity] Consider target context for Relay to Relax
conversion (#14269)
omit 08e2a69efc [Unity][Frontend] Import `tanh` and fix `layer_norm`
(#14247)
omit 71899e5529 [Unity][BYOC] Add conv2d and residual block patterns for
Relax cutlass BYOC (#14252)
omit 77695deec6 [Unity] Allow user defined func attrs in emit_te (#14255)
omit 3cb9e263b9 [Unity][Op] Add repeat, tile, conv2d_transpose, avg_pool2d
(#14238)
omit ac82cf8b0c [Unity][Op][Tweak] Improve `StructInfo` inference for
`shape_of` (#14243)
omit 3bddee1524 [Unity][WEB] Improve ndarray cache (#14236)
omit 556b542611 [Unity][WEB] Update text prompts for syntactical
correctness (#14237)
omit fba4b6bc50 [Unity][TVMScript] Fix prim_func lost issue in
relax.emit_te (#14189)
omit 2a32d64ef1 [Unity][TVMScript] Enable Context-Aware Parsing (#14234)
omit 6ca3325a73 [Unity][Bugfix] Do not include `PrimFunc`s in the
dependency graph when checking for recursion (#14228)
omit 80fce8db81 [Unity][Transform] SimplifyNormInference (#14221)
omit 544b0821ae [Unity] Improve implementation of FuseOps (#14229)
omit a6d9601595 [Unity] ensure memory.alloc_tensor/storage roundtrippable
(#14226)
omit a3f40a7635 [Unity][WEB] Simplify WebGPU Codegen per spec (#14225)
omit 4c39c31767 [Unity][Transform] Memory plan across the IRModule (#14220)
omit 6de29c50a2 [Unity][BYOC] Add dynamic shape support to CUTLASS matmul
(#14216)
omit 7a4bdcde3c [Unity][Frontend] from_fx keeps parameters in order (#14214)
omit 45a54f3a38 [Unity][WEB] Improve webgpu codegen options to skip
readonly (#14213)
omit 58e224f8b1 [Unity][Frontend] FX translator supports unwrapping unit
return tuple (#14212)
omit 4920cd26df [Unity][Frontend] Attach imported model weights, deprecate
ImporterOutput (#14211)
omit f7ccc3bc59 [Unity] Introduce Default GPU Schedule Pass (#14182)
omit 03e413ae43 [Unity][Frontend] FX translator support torch.baddbmm
(#14202)
omit 1978e44971 [Unity][TIR][Pass] ForceNarrowIndexToInt32 (#14203)
omit 2c75602cb4 [Unity][Fix] FX translating dtype (#14201)
omit 1896823417 [Unity][Frontend] FX translator returning weights with
`keep_params_as_input` (#14197)
omit 5bafde482d [Unity][Frontend] FX translator supporting more ops (#14196)
omit 012dacec71 [Unity][Op] Legalize `round`, `floor`, `ceil`, `sign`
(#14198)
omit 694da73413 [Unity][Op] Argmax and argmin (#14195)
omit 32049d825b [Unity][Op] Group normalization (#14194)
omit d68bfb97ee [Unity][Transform] LiftTransformParams handling multiple
functions (#14192)
omit 9ade1be9f7 [Unity][WEBGPU] Codegen improvements and WebRuntime (#14187)
omit 6c3a97c71c [Unity][OP] Add an operator for fused multi head attention
(#14150)
omit 031e380c47 [Unity][Analysis] Restore Python bindings for var analyses
(#14180)
omit fb3e269c71 [Unity][Op] Full support of Relax op `power` (#14171)
omit d50be1cdf6 [Unity][BYOC] Add batch matmul support to Relax CUTLASS
BYOC (#14166)
omit 5f4a11a284 [Unity][Analysis] Analysis for detecting recursion in Relax
(#14149)
omit 70e925c8de [Unity] Add bind_constants option to FuseOpsByPattern
(#14151)
omit 96d85b2da5 [Unity][BYOC] Use Relax legalize + CPU build for reference
in tests (#14162)
omit 1d60a6a337 [Unity][Analysis] Checking function return struct info in
well-formed check (#14155)
omit 78af3acde3 [Unity][Pass] Support Symbolic Shape Deduction during
BindParam (#14154)
omit 832c1ba04c [Unity][Debugging] AST printer (#14152)
omit 016b2800a1 [Unity][Pass] Enhance constant folding to fold relax ops by
evaluating them. (#14146)
omit 993c37d3c2 [Unity][Legalize] Fix Scalar Constant Legalization (#14127)
omit 4892b763b9 [Unity] Add callback to FuseOpsByPattern to check match
result is accepted (#14109)
omit 5f5638c05a [Unity][BYOC] Assign group to unused bindings and ignroe
PrimFunc (#14139)
omit a5fbbd573f [Unity][TVMScript] emit_te sugar (#14123)
omit 8c1d87a46c [Unity][BYOC] Add transposed matmul support to Relax
CUTLASS BYOC (#14128)
omit e85a1909db [Unity] Add Global info (#14132)
omit 17d8625a73 [Unity][WEB] Relax vm on web runtime (#14131)
omit 631e483330 [Unity][BlockBuilder] Add `name_hint` argument for `emit`
and `emit_output` (#14126)
omit 81a6438bc7 [Unity][Fix] Fix bug in MergeCompositeFunctions (#14117)
omit de2e70778e [Unity] Update tests again to adapt to latest TVMScript
syntax (#14115)
omit c973eae56c [Unity][BYOC]Add relax backend pattern registry (#14106)
omit 7ac87251d0 [Unity] Remove attributes of relax.print, assert and unique
(#14101)
omit dd00671ae3 [Unity][Layout] Add layout transformation analysis for
PrimFunc (#14066)
omit 35331cdea2 [Unity] Relax Recursive function (#14092)
omit 1ea40509c9 [Unity] Lower `shape_of` to a builtin (#14093)
omit 3e139b0a93 [Unity] Fix typo in the comment (#14096)
omit 111dd1f6f5 [Unity][Relax] Set Shape Function to Be Host Function
(#14090)
omit c728978f51 [Unity] Refactor Relax Build JIT UX (#14088)
omit fa0f49a6a7 [Unity][Fix][Pass] FoldConstant with DCE in dataflow block
(#14087)
omit b5e6048361 [Unity][Analysis] TIR pattern kind analysis for
multi-buffer write block (#14075)
omit cb7e29f7de [Unity][Op] `log_softmax` and `cross_entropy_with_logits`
(#14083)
omit 394f1261a5 [Unity][BYOC] Add DNNL backend (#14082)
omit 1774d2229c [Unity][BYOC] Add CUTLASS backend (#14081)
omit 418eaf0b6b [Unity] Add testcases for `expr_args_converter` (#14080)
omit abdfe98d85 [Unity][Pass] Canonicalize Bindings (#14079)
omit 183e4e1d84 [Unity][BYOC][Pass] RunCodegen and TensorRT (#14078)
omit ac49e71881 [Unity][Transform] Add LiftTransformParams pass (#14069)
omit 575fee9bb3 [Unity][Frontend] Annotate number of non-static input of FX
function (#14067)
omit e6fdfc6075 [Unity][BYOC] Add pass to merge composite functions to
offload large subgraphs (#14062)
omit 5f15d3a5fb [Unity][Pass] Remove Unused Function (#14061)
omit daa3184b29 [Unity][Fix][Pass] Fix FuseOps for lack graph edges (#14058)
omit 3097f6648f [Unity] Relax op: collapse sum (#14059)
omit 9b1948d0ba [Unity][BYOC] Add pattern-based partitioning pass (#14054)
omit b23e18c228 [Unity][VM] Add per-op profiling support (#14053)
omit 8bad813c99 [Unity][TVMScript] Overload `__neg__` for relax expr
(#14045)
omit 80c474fbf1 [Unity][Pass] FuseOps FuseTIR fixes (#14044)
new 8016f30e4b [Unity][Pass] FuseOps FuseTIR fixes (#14044)
new eb660f2377 [Unity][TVMScript] Overload `__neg__` for relax expr
(#14045)
new f02d567791 [Unity][VM] Add per-op profiling support (#14053)
new 5a65b79e6f [Unity][BYOC] Add pattern-based partitioning pass (#14054)
new 71e688c952 [Unity] Relax op: collapse sum (#14059)
new 197d3a8d66 [Unity][Fix][Pass] Fix FuseOps for lack graph edges (#14058)
new 3f18864f78 [Unity][Pass] Remove Unused Function (#14061)
new 2994125e3a [Unity][BYOC] Add pass to merge composite functions to
offload large subgraphs (#14062)
new 9f7a040c0e [Unity][Frontend] Annotate number of non-static input of FX
function (#14067)
new 8615b16565 [Unity][Transform] Add LiftTransformParams pass (#14069)
new a816838a99 [Unity][BYOC][Pass] RunCodegen and TensorRT (#14078)
new 2fbf8ad3ba [Unity][Pass] Canonicalize Bindings (#14079)
new f015a97ac8 [Unity] Add testcases for `expr_args_converter` (#14080)
new 6d3a9b3032 [Unity][BYOC] Add CUTLASS backend (#14081)
new f0399994b5 [Unity][BYOC] Add DNNL backend (#14082)
new 85b6f66508 [Unity][Op] `log_softmax` and `cross_entropy_with_logits`
(#14083)
new fadf377c83 [Unity][Analysis] TIR pattern kind analysis for
multi-buffer write block (#14075)
new 5aa8547300 [Unity][Fix][Pass] FoldConstant with DCE in dataflow block
(#14087)
new 9d6c86680c [Unity] Refactor Relax Build JIT UX (#14088)
new a8fd3ffb41 [Unity][Relax] Set Shape Function to Be Host Function
(#14090)
new 6235c6d652 [Unity] Fix typo in the comment (#14096)
new ea0d012123 [Unity] Lower `shape_of` to a builtin (#14093)
new 5f8e0aa2ad [Unity] Relax Recursive function (#14092)
new 9aa6926f97 [Unity][Layout] Add layout transformation analysis for
PrimFunc (#14066)
new 6a7bdf57ac [Unity] Remove attributes of relax.print, assert and unique
(#14101)
new 42a06d878d [Unity][BYOC]Add relax backend pattern registry (#14106)
new c2c84991ae [Unity] Update tests again to adapt to latest TVMScript
syntax (#14115)
new d537bcd977 [Unity][Fix] Fix bug in MergeCompositeFunctions (#14117)
new 209ee04928 [Unity][BlockBuilder] Add `name_hint` argument for `emit`
and `emit_output` (#14126)
new 44f1bfedb5 [Unity][WEB] Relax vm on web runtime (#14131)
new ff95127cc0 [Unity] Add Global info (#14132)
new 4c27f82564 [Unity][BYOC] Add transposed matmul support to Relax
CUTLASS BYOC (#14128)
new ed777d5097 [Unity][TVMScript] emit_te sugar (#14123)
new 16f81969bd [Unity][BYOC] Assign group to unused bindings and ignroe
PrimFunc (#14139)
new 008fce3082 [Unity] Add callback to FuseOpsByPattern to check match
result is accepted (#14109)
new 959f5b3a16 [Unity][Legalize] Fix Scalar Constant Legalization (#14127)
new 56e8114043 [Unity][Pass] Enhance constant folding to fold relax ops by
evaluating them. (#14146)
new d83d4e52e3 [Unity][Debugging] AST printer (#14152)
new 6ac387cfea [Unity][Pass] Support Symbolic Shape Deduction during
BindParam (#14154)
new a5b0555dc3 [Unity][Analysis] Checking function return struct info in
well-formed check (#14155)
new 250cf734bd [Unity][BYOC] Use Relax legalize + CPU build for reference
in tests (#14162)
new eec815c584 [Unity] Add bind_constants option to FuseOpsByPattern
(#14151)
new d266b3b6ca [Unity][Analysis] Analysis for detecting recursion in Relax
(#14149)
new 0640642ee6 [Unity][BYOC] Add batch matmul support to Relax CUTLASS
BYOC (#14166)
new fa29543f2d [Unity][Op] Full support of Relax op `power` (#14171)
new f3e391ae3a [Unity][Analysis] Restore Python bindings for var analyses
(#14180)
new 59ec211dd1 [Unity][OP] Add an operator for fused multi head attention
(#14150)
new 930df87cb5 [Unity][WEBGPU] Codegen improvements and WebRuntime (#14187)
new 87aea68e60 [Unity][Transform] LiftTransformParams handling multiple
functions (#14192)
new 30f5c5a14f [Unity][Op] Group normalization (#14194)
new a54fcbef49 [Unity][Op] Argmax and argmin (#14195)
new 60a23a5e6f [Unity][Op] Legalize `round`, `floor`, `ceil`, `sign`
(#14198)
new 040dec513d [Unity][Frontend] FX translator supporting more ops (#14196)
new 82de2b24b4 [Unity][Frontend] FX translator returning weights with
`keep_params_as_input` (#14197)
new f4f122589a [Unity][Fix] FX translating dtype (#14201)
new b29a518bee [Unity][TIR][Pass] ForceNarrowIndexToInt32 (#14203)
new 16ca7ded54 [Unity][Frontend] FX translator support torch.baddbmm
(#14202)
new 313faa7292 [Unity] Introduce Default GPU Schedule Pass (#14182)
new a3a593b492 [Unity][Frontend] Attach imported model weights, deprecate
ImporterOutput (#14211)
new d013e834a9 [Unity][Frontend] FX translator supports unwrapping unit
return tuple (#14212)
new 6d2db12fed [Unity][WEB] Improve webgpu codegen options to skip
readonly (#14213)
new 1bb59ff0d5 [Unity][Frontend] from_fx keeps parameters in order (#14214)
new 17837b471c [Unity][BYOC] Add dynamic shape support to CUTLASS matmul
(#14216)
new bbe04cb457 [Unity][Transform] Memory plan across the IRModule (#14220)
new ce22d8edc9 [Unity][WEB] Simplify WebGPU Codegen per spec (#14225)
new 1bf1dd0578 [Unity] ensure memory.alloc_tensor/storage roundtrippable
(#14226)
new d4a4e81623 [Unity] Improve implementation of FuseOps (#14229)
new 451f95554b [Unity][Transform] SimplifyNormInference (#14221)
new ada8675944 [Unity][Bugfix] Do not include `PrimFunc`s in the
dependency graph when checking for recursion (#14228)
new 8bdfac16e0 [Unity][TVMScript] Enable Context-Aware Parsing (#14234)
new 500042f9ae [Unity][TVMScript] Fix prim_func lost issue in
relax.emit_te (#14189)
new 866e2379c5 [Unity][WEB] Update text prompts for syntactical
correctness (#14237)
new 3069fa0dd0 [Unity][WEB] Improve ndarray cache (#14236)
new edaa4529b3 [Unity][Op][Tweak] Improve `StructInfo` inference for
`shape_of` (#14243)
new 02bf2439af [Unity][Op] Add repeat, tile, conv2d_transpose, avg_pool2d
(#14238)
new eeac231d76 [Unity] Allow user defined func attrs in emit_te (#14255)
new 2ca9fa4180 [Unity][BYOC] Add conv2d and residual block patterns for
Relax cutlass BYOC (#14252)
new 3e9c8d327b [Unity][Frontend] Import `tanh` and fix `layer_norm`
(#14247)
new bb9e9d0e55 [Unity] Consider target context for Relay to Relax
conversion (#14269)
new 5963846dbf [Unity] Introduce call_dps_packed (#14183)
new 59cc3219b2 [Unity][ci] Use CPU-SMALL instances (#14256)
new 360f7566ac [TVMScript][Unity] Improve PyLint Compatibility (#14276)
new c513cb9c70 [Unity] Mark tests that need python3.8 compact.
new 72513b7287 [Unity] Fix StructInfo Infer for `vm.alloc_tensor` (#14283)
new ed01b9d90e [Unity][Op] Cumsum (#14297)
new c2b19be187 [Unity][DEBUG] Add Instrument (#14302)
new 79add7bb73 [Unity][Pass] Add a pass to alter the TIR implementation of
an operator (#14215)
new 46e78ed951 [Unity][Op] Enable special dimension value 0 in reshape
(#14311)
new a61db82f98 [Unity][Web] WebGPU explicit max buffer size (#14321)
new 89f54c9d89 [Unity] Support pattern-based rewriting (#14312)
new 0b675407f6 [Unity][BYOC] Support matmul + residual block fusion in
CUTLASS BYOC (#14317)
new 10774dc447 [Unity][BYOC] Improve expressiveness of the pattern check
function in FuseOpsByPattern (#14310)
new 932e702094 [Unity][Transform] Enhance RewriteDataflowReshape transform
(#14265)
new e47cd3634f [Unity][Contrib] Introduce several features of cutlass
profiler (#14275)
new fd34a2f5b6 [Unity][TOPI] fp16 LayerNorm & GroupNorm (#14264)
new 226745f0d7 [Unity][Transform] Automatic Layout Conversion (#14257)
new 543bc300f7 [Unity][Transform] Simple Dead Code Elimination (#14262)
new 41c7761c40 [Unity][Transform] Automatic Mixed Precision (#14242)
new b898379f67 [Unity][Lint] Fix cpplint casting (#14333)
new 90cf347c01 [Unity][Transform] DefaultSchedule pass (#14266)
new e75838d0aa [Unity][Op] Fix Strided Slice Shape Inference (#14324)
new a2e3826fb5 [Unity][Pass] Reuse prior infra to implement more complete
DCE (#14334)
new 98846fdbad [Unity] Remove Python interface of RemoveUnusedFunction
(#14336)
new 8e785f7fba [Unity][BYOC] Update testcases to follow recent changes
(#14339)
new 0e5c55f5a0 [Unity][Frontend] FX exp and strided_slice fix (#14338)
new dae16e88b1 [Unity] Support model kwargs in dynamo_capture_subgraph
(#14349)
new 8b740442ba [Unity][BYOC] Check leaked intermediate variables in
cutlass patterns (#14350)
new 3c6ec6a4ec [Unity][Transform] AMP out_dtype=float16 testcases (#14358)
new b7e56dfc17 [Unity][Fix] Fix block memory plan to handle bool (#14357)
new 6699a75247 [Unity][Transform] Introduce data-dependent operation of
reshape and its constant folding (#14282)
new 367ba1eb1f [Unity][Transform] Fix AMP tests (#14360)
new 0e2c944d0f [Unity] Add support to append relay op attrs in translator
(#14356)
new 0dae4870e3 [Unity][WEB] Support async pipeline creation (#14362)
new 7a21f00ed0 [Unity][Pass] Fix FuseOps error if there is no output of a
given group (#14354)
new 1377c87dab [Unity][Fix] Infer Layout must support negative axes
(#14365)
new a1158ab679 [Unity] Add More Ops For FX Translator (#14348)
new 83583d82fd [Unity][TVMScript] Update GlobalVar `checked_type_` when
`emit_te` (#14367)
new a80d07f5ce [Unity][Fix] Allow scalar layout initialization (#14370)
new bb961ca5ae [Unity] Also include output dtype in simt MathInstruction
(#14372)
new 63921a926d [Unity][VM] Add CUDA graph vm builtins (#14371)
new c43b7042a2 [Unity] Add missing #include <array> (#14383)
new 6540bc2f32 [Unity][Transform] SplitCallTIRByPattern and CUTLASS
backend (#14274)
new 4c88e1ef4c [Unity] Support simple dynamic-shape-aware fusion (#14396)
new af1bf15311 [Unity][Op] Add stop_lift_params (#14368)
new 58b581ee77 [Unity][TVMScript] Fix Shape Var occurrence in Tensor
annotation (#14404)
new a7b9347998 [Unity][Transform] Common Subexpression Elimination (#14361)
new e797baa915 [Unity][QNN][Hexagon]Support Relax Constants in the QNN
TOPI operations (#14386)
new a85acb37a6 [Unity][Op] Conv1d (#14388)
new 9d1b2d5b21 [Unity] Fix getting shapes for cutlass BYOC kernels (#14411)
new 0a6a62f1e6 [Unity][Op] Expose scale in `R.nn.attention` and add its
legalize op (#14412)
new 56d8d208a6 [Unity][Hexagon] Enable Relax VM for Hexagon (#14415)
new 5951059dd8 [Unity][Fix] Copy over module attrs in FuseTIR (#14418)
new 3163763e1a [Unity] Handle extern func calls in static memory planning
(#14419)
new 72231d2c34 [Unity] Include constant shapes in the profiler result
(#14428)
new 053609e64f [Unity][Fix] Annotate TIR op pattern could have no stores.
(#14420)
new 8f567b5477 [Unity] Minor updates to DataFlowBlockRewrite (#14431)
new 40e6ca73b1 [Unity] Remove non-deterministic behavior from graph
pattern matching (#14417)
new 17b8c6cbae [Unity][Graph matching] Automatically add `used-by`
constraints for `is_op` pattern (#14439)
new 228ab429c6 [Unity][Op][Docs] Update comment for `call_tir_dyn` (#14441)
new 821a4fb40f [Unity][Graph matching] Clean up undo stack for parent and
child nodes properly (#14440)
new 6ffad3374e [Unity] Pattern-based rewriting for dataflow block (#14446)
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (a425bc7a39)
\
N -- N -- N refs/heads/unity-staging (6ffad3374e)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
The 141 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
src/relax/transform/fuse_tir.cc | 24 ------------------------
src/runtime/hexagon/hexagon_module.h | 1 -
2 files changed, 25 deletions(-)