leezu commented on issue #17521: cmake: don't build PTX and 3.5 arch if cuda 
arch detection fails
URL: https://github.com/apache/incubator-mxnet/pull/17521#issuecomment-582516533
 
 
   > `We don't need 3.5, because code compiled for 3.0 also runs on 3.5 
hardware` - I do not believe that is the case, those archs are not binary 
compatible. I would say that the better choice would be to remove SM3.0 (as 
SM3.5 is still accessible via p2 AWS instance). We do not need PTX for 7.0 if 
we include PTX for 7.5.
   
   Why is this wrong? Binary forward compatibility is explicitly described in 
Nvidia docs 
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#binary-compatibility

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to