Laurawly commented on pull request #8808:
URL: https://github.com/apache/tvm/pull/8808#issuecomment-903949096


   > Hi, this commit is used to add calibration and tensorrt int8 mode in the 
original src/runtime/contrib/tensorrt. Currently, I tested several models 
including mobilenet_v2 and resnet34, and all of them have greater performance 
gain. Here is my GitHub gist for testing trt int8 result: 
https://gist.github.com/tiandiao123/67adb11ab3d73df8e83a1469707d7db4
   > 
   > Firstly, users need to provide calibration data to do calibration, then 
use the following command line to run your program:
   > 
   > ```
   > TVM_TENSORRT_USE_INT8=1 TENSORRT_NUM_CALI_INT8=10 python test_trt.py
   > ```
   > 
   > , where TVM_TENSORRT_USE_INT8 denotes whether to use trt int8, and 
TENSORRT_NUM_CALI_INT8 denotes how many times do we need to do data calibration 
(we'll do 10-times data calibration in the example).
   
   Please add some performance numbers for the models you tested.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to