JCBrouwer commented on issue #10223:
URL: https://github.com/apache/tvm/issues/10223#issuecomment-1051938367


   Hello @masahi , sorry for the slow response, I somehow missed the 
notification on this one, thanks for enabling the op!
   
   I took a look at running on your branch and was also getting BAD_PARAMs on 
both the above test case and my full model. After a bit of mucking around I 
noticed [this 
change](https://github.com/apache/tvm/compare/main...masahi:conv2d-transpose-group-cudnn?expand=1#diff-ea7aa778342026f1671b0ec3af0e24e01aa4375532c7744e2274c20f885b89d6R303)
 is incorrect. The argument is the conv_mode which should be left as 1 
(according to the main branch).
   
   Changing that back I'm able to run both the test case and my larger model 
with grouped conv2d_transpose ops on the CUDNN backend :tada:
   
   Sadly I'm still just a few FPS shy of my performance target so I'll have to 
keep on digging for speedups.
   
   RE: support for groups in the regular cuda backend. Do you have a general 
idea of what kind of changes are necessary for that? I'm no expert, but I might 
be able to figure it out if it's just adapting similar code from grouped conv2d 
to work for grouped conv2d_transpose.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to