Lunderberg commented on PR #16689:
URL: https://github.com/apache/tvm/pull/16689#issuecomment-1998475761

   Thank you for the quick turnaround on the fix, and I like it.  I agree that 
most GPU frameworks are asynchronous by design, and by necessity.  My concern 
was mainly that it was a change in the existing
   
   > The default CUDA sync behavior of copyfromto actually was mainly limited 
to the default stream, so it was some what limited to cuda for a default stream 
setting.
   
   Ah, I had thought that was intentional.  Absent any explicit opt-in, the GPU 
operations would be synchronized on attempting to read, but all sequences of 
GPU operations would be asynchronous.  With the stream parameter, the transfers 
to the GPU would also be async.
   
   I like the change, to have the most common API be synchronous, while all 
internal APIs are asynchronous.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to