manishucsd commented on pull request #10185:
URL: https://github.com/apache/tvm/pull/10185#issuecomment-1032821966


   Hi Masa, This is amazing progress. Some questions on the known issues:
   
   - Accuracy alignment with cuDNN
   > Can you share the size that has accuracy issues. Can you repro the 
accuracy issue in profiler? 
   -Figure out workspace memory reuse
   > Both cuDNN and CUTLASS offers similar get_workspace_size(...) API. Thus, I 
believe this part should be similar. 
   -Split-k parameter selection strategy
   > As we discussed in an another thread, we run sweeps to find the best 
split. You can cut down the sweep on k by using a simple analytic model. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to