anirudh2290 commented on issue #14725: Performance Regression on CUDA10
URL: 
https://github.com/apache/incubator-mxnet/issues/14725#issuecomment-485580778
 
 
   Thanks for the useful suggestion @KellenSunderland . @stu1130 obtained the 
logs and didnt see any volta_sgemm_128x64_nt calls . We see 
cublasSgemmEx_internal calls and we tried to grep for configuration of m=128 
n=64 but weren't able to find anything. I am not sure which cublas call maps to 
this call (volta_sgemm_128_64_nt) and we have also asked nvidia for help. Let 
us know if you have any ideas.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to