yelite opened a new pull request, #14252:
URL: https://github.com/apache/tvm/pull/14252

   This PR adds cutlass BYOC pattern for conv2d and conv2d residual block. 
   
   In additional to the new patterns in the cutlass pattern table, this PR:
   1. Remove the assumption on fused function arg order from the conv2d codegen.
   2. Restructure how the cutlass pattern table is constructed, making it more 
readable.
   
   The check logic at 
https://github.com/apache/tvm/blob/main/python/tvm/relay/op/contrib/cutlass.py#L200
 isn't implemented in this PR yet because it requires non-trivial change to the 
interface of FuseOpsByPattern. I will send a separate PR to do this.
   
   cc @vinx13 @masahi 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to