srkreddy1238 commented on PR #13413: URL: https://github.com/apache/tvm/pull/13413#issuecomment-1323326740
Thanks for the review. TVM benchmarks generally evaluate ```run``` call ignoring the ```set_input``` & ```get_output```. There exist a significant end to end performance overhead caused due to input/output (copes and also using different input buffer every time affects cache too). This was very evident when I benchmarked TVM model over MLPerf android app. Buffer sharing is well known practice and is supported by most of the edge platforms across cores like Camera ISP, GPU, CPU...etc. Motivation here is to encourage the runtime backends to support Native Ptr access. This can retain the TVM performance numbers at final application level with less overheads. I am good with packed function also for now until there is more demand to expose native buffers to applications via NDArray. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
