JackWeiw opened a new pull request, #15986: URL: https://github.com/apache/tvm/pull/15986
`compact_buffer_region` PASS modify shared buffer stride[0] to `T.int64(72) * T.min((n + T.int64(63)) // T.int64(64) * T.int64(64), T.int64(96))` and stride[1] is `T.int64(72)` but in LowerOpaqueBlock PASS it report error: InternalError: Check failed: (is_zero(floormod(buffer->strides[i - 1], buffer->strides[i]))) is false: For more detaied discuss, see [here](https://discuss.tvm.apache.org/t/bug-tir-symbolic-floormod/15826) Another bug occurs in PASS InjectPTXAsyncCopy . that is dst_offset.dtype could be int64, the dtype of PrimExpr(index_factor) would be set to default to int32. cause dtype inconsistent when calling tir::Mul. To reproduce the problem in InjectPTXAsyncCopy, see script [here](https://gist.github.com/JackWeiw/5b80956ab44c0f63d4f434f18f42cc89) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
