rkimball opened a new pull request #7428:
URL: https://github.com/apache/tvm/pull/7428


   * Added pass AnnotateDevicePlacement which allows for per-Call op placement 
using on_device. This allows for heterogeneous execution of non-external 
compilers such as CPU and GPU.
   * Added unit test for AnnotateDevicePlacement.
   * Moved the on_device definition from an anonymous lambda to a c++ function 
so that it can be called directly from c++ instead of forcing the use of 
PackedFunction calls.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to