trevor-m commented on pull request #8174: URL: https://github.com/apache/tvm/pull/8174#issuecomment-854235480
The input shapes are `boxes=(1, 1917, 1, 4)` and `scores=(1, 1917, 91)`. Here are some more numbers for TF SSD Mobilenet v1. | Implementation | Thrust | Latency (ms) | | --- | --- | --- | | Previous | Yes | 3.41 | | Previous | No | 3.85 | | New | Yes | 4.56 | | New | No | 2.98 ms | It looks like without thrust this implementation is much faster! Even if we are unable to improve the thrust performance in this PR, I still prefer this new implementation because of the static shapes. The performance is still much better than TF anyway which is ~6ms. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
