mbs-octoml opened a new pull request #9689:
URL: https://github.com/apache/tvm/pull/9689


   This is in support of #9613 which allows memory scopes to flow
   out of already-lowered PrimFuncs into the rest of the Relay
   program. This means scope choices made during lowering can
   be accounted for in the rest of the program, with device_copies
   inserted as required.
   
   Somewhat more speculatively we also allow memory scopes to flow
   in to PrimFuncs. This is in preparation for when we can split
   lowering into two phases: i) lower "primitive" fused Relay
   functions to TensorIR in a schedulable form roughly isomorphic
   to TE, and ii) actual scheduling down to traditional TIR. Once
   that split is made it will be possible to flow memory scopes
   out of one PrimFunc and into another so as to avoid unnecessary
   device_copies being necessary due to independently chosen
   memory scopes.
   
   I also suspect we'll want to put our focus on layouts rather
   than memory scopes, but this at least sets up some of the
   machinery.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to