manishucsd commented on pull request #10185: URL: https://github.com/apache/tvm/pull/10185#issuecomment-1032821966
Hi Masa, This is amazing progress. Some questions on the known issues: - Accuracy alignment with cuDNN > Can you share the size that has accuracy issues. Can you repro the accuracy issue in profiler? -Figure out workspace memory reuse > Both cuDNN and CUTLASS offers similar get_workspace_size(...) API. Thus, I believe this part should be similar. -Split-k parameter selection strategy > As we discussed in an another thread, we run sweeps to find the best split. You can cut down the sweep on k by using a simple analytic model. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
