trevor-m commented on pull request #8174:
URL: https://github.com/apache/tvm/pull/8174#issuecomment-854235480


   The input shapes are `boxes=(1, 1917, 1, 4)` and `scores=(1, 1917, 91)`.
   
   Here are some more numbers for TF SSD Mobilenet v1.
   | Implementation | Thrust | Latency (ms) |
   | --- | --- | --- |
   | Previous | Yes | 3.41 |
   | Previous | No | 3.85 |
   | New | Yes | 4.56 |
   | New | No |  2.98 ms |
   
   It looks like without thrust this implementation is much faster!
   
   Even if we are unable to improve the thrust performance in this PR, I still 
prefer this new implementation because of the static shapes. The performance is 
still much better than TF anyway which is ~6ms.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to