yelite opened a new pull request, #14252: URL: https://github.com/apache/tvm/pull/14252
This PR adds cutlass BYOC pattern for conv2d and conv2d residual block. In additional to the new patterns in the cutlass pattern table, this PR: 1. Remove the assumption on fused function arg order from the conv2d codegen. 2. Restructure how the cutlass pattern table is constructed, making it more readable. The check logic at https://github.com/apache/tvm/blob/main/python/tvm/relay/op/contrib/cutlass.py#L200 isn't implemented in this PR yet because it requires non-trivial change to the interface of FuseOpsByPattern. I will send a separate PR to do this. cc @vinx13 @masahi -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
