srkreddy1238 commented on PR #13413:
URL: https://github.com/apache/tvm/pull/13413#issuecomment-1323326740

   Thanks for the review.
   
   TVM benchmarks generally evaluate ```run``` call ignoring the 
```set_input``` & ```get_output```. There exist a significant end to end 
performance overhead caused due to input/output (copes and also using different 
input buffer every time affects cache too). This was very evident when I 
benchmarked TVM model over MLPerf android app.
   
   Buffer sharing is well known practice and is supported by most of the edge 
platforms across cores like Camera ISP, GPU, CPU...etc. Motivation here is to 
encourage the runtime backends to support Native Ptr access. This can retain 
the TVM performance numbers at final application level with less overheads.
   
   I am good with packed function also for now until there is more demand to 
expose native buffers to applications via NDArray.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to