This is an automated email from the ASF dual-hosted git repository.
masahi pushed a change to branch ci-docker-staging
in repository https://gitbox.apache.org/repos/asf/tvm.git
discard 370eca4d23 update min sm requirement
discard d956c94968 more tf fix
discard 8ab12931fc more tf fix
discard b6b4d4d6db fix TF tests
discard e3681e9404 trigger more run
discard 93dcb1871a update gpu images
add fc2a9e50af [CODEGEN][METAL] Fix unaligned vector load (#14332)
add c6c89c3a25 [Hexagon] Add concept of DMA groups (#14254)
add 06276846a1 [Docs] Update listed tvmc python dependencies (#14341)
add 36b30974a9 [MetaSchedule] Introducing MemHammer (#14164)
add 7f6da09052 [TIR] Fix Datatype in Lower TVM Builtin (#14347)
add 4819300803 [CI][Lint] Update black (#14346)
add 50b3ae4877 [TIR] [Analysis] Expose IsOutputBlock to python (#14352)
add d4ca123afc [BugFix] Support rewrite_once when the number of callbacks
> 1 (#14344)
new 32f4846602 update gpu images
new 418c63da14 trigger more run
new 16a31ea78c fix TF tests
new b35ed68711 more tf fix
new 09e13d32e9 more tf fix
new ac5cdadcbf update min sm requirement
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (370eca4d23)
\
N -- N -- N refs/heads/ci-docker-staging (ac5cdadcbf)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
The 6 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
docker/Dockerfile.ci_lint | 2 +-
docs/install/from_source.rst | 2 +-
include/tvm/meta_schedule/postproc.h | 3 +-
include/tvm/tir/builtin.h | 37 +-
include/tvm/tir/schedule/schedule.h | 5 +
include/tvm/tir/stmt.h | 31 +
include/tvm/tir/transform.h | 6 +
.../postproc/disallow_async_strided_mem_copy.py | 11 +-
python/tvm/tir/schedule/analysis.py | 19 +
python/tvm/tir/schedule/schedule.py | 24 +
python/tvm/tir/tensor_intrin/hexagon.py | 35 +-
python/tvm/tir/transform/transform.py | 11 +
src/driver/driver_api.cc | 2 +-
.../feature_extractor/per_store_feature.cc | 1 +
.../postproc/disallow_async_strided_mem_copy.cc | 8 +-
src/meta_schedule/postproc/verify_gpu_code.cc | 1 +
src/relay/ir/dataflow_matcher.cc | 37 +-
src/runtime/hexagon/hexagon_device_api.cc | 19 +-
src/runtime/hexagon/hexagon_user_dma.cc | 14 +-
src/runtime/hexagon/hexagon_user_dma.h | 31 +-
src/runtime/hexagon/ring_buffer.h | 76 +-
src/target/source/codegen_c.cc | 15 +-
src/target/source/codegen_metal.cc | 17 +-
src/target/source/codegen_metal.h | 4 +-
src/target/source/codegen_opencl.cc | 37 +
src/target/source/codegen_opencl.h | 3 +
src/tir/op/builtin.cc | 6 +
src/tir/schedule/analysis/analysis.cc | 5 +
src/tir/schedule/concrete_schedule.cc | 24 +
src/tir/schedule/concrete_schedule.h | 5 +
src/tir/schedule/primitive.h | 9 +
src/tir/schedule/primitive/read_write_at.cc | 421 ++++++++
src/tir/schedule/schedule.cc | 4 +
src/tir/schedule/traced_schedule.cc | 28 +
src/tir/schedule/traced_schedule.h | 5 +
src/tir/transforms/inject_software_pipeline.cc | 35 +-
src/tir/transforms/lower_async_dma.cc | 172 ++--
src/tir/transforms/lower_tvm_builtin.cc | 43 +-
src/tir/transforms/memhammer_coalesce.cc | 234 +++++
src/tir/transforms/memhammer_intermediate_stage.cc | 444 ++++++++
src/tir/transforms/memhammer_lower_auto_copy.cc | 779 ++++++++++++++
src/tir/transforms/memhammer_rewrite_rule.h | 242 +++++
src/tir/transforms/memhammer_tensorcore_rewrite.cc | 350 +++++++
.../cpp-runtime/hexagon/hexagon_user_dma_tests.cc | 2 +-
tests/cpp-runtime/hexagon/ring_buffer_tests.cc | 203 +++-
.../metaschedule_e2e/test_resnet50_int8.py | 1 -
.../test_hexagon/test_async_dma_pipeline.py | 6 +-
.../test_hexagon/test_software_pipeline_async.py | 1 -
tests/python/relay/test_dataflow_pattern.py | 79 +-
tests/python/unittest/test_target_codegen_metal.py | 25 +
.../python/unittest/test_tir_schedule_analysis.py | 21 +
.../test_tir_transform_lower_tvm_builtin.py | 19 +-
...test_tir_transform_memhammer_lower_auto_copy.py | 1062 ++++++++++++++++++++
53 files changed, 4424 insertions(+), 252 deletions(-)
create mode 100644 src/tir/schedule/primitive/read_write_at.cc
create mode 100644 src/tir/transforms/memhammer_coalesce.cc
create mode 100644 src/tir/transforms/memhammer_intermediate_stage.cc
create mode 100644 src/tir/transforms/memhammer_lower_auto_copy.cc
create mode 100644 src/tir/transforms/memhammer_rewrite_rule.h
create mode 100644 src/tir/transforms/memhammer_tensorcore_rewrite.cc
create mode 100644
tests/python/unittest/test_tir_transform_memhammer_lower_auto_copy.py