[GitHub] [incubator-tvm] yongwww commented on pull request #6108: Fix CUDA Compute Function For `get_valid_counts` and `nms`

GitBox Mon, 14 Sep 2020 18:04:43 -0700


yongwww commented on pull request #6108:
URL: https://github.com/apache/incubator-tvm/pull/6108#issuecomment-692398165



   @lsy643 thanks for sharing the results. What I am wondering is the latency 
of your change vs previous nms gpu version (even the output is not identical), 
and probably the perf number of your change vs TensorFlow baseline. As Leyuan 
mentioned above, the thread related change might cause performance regression, 
performance matters a lot for us, so we would like to see some perf number 
about this. If performance regression does exist, then it should be fixed. 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [incubator-tvm] yongwww commented on pull request #6108: Fix CUDA Compute Function For `get_valid_counts` and `nms`

Reply via email to