[GitHub] [incubator-mxnet] bartekkuncer commented on pull request #21127: [DOC] Add tutotrial about improving accuracy of quantization with oneDNN

GitBox Thu, 25 Aug 2022 05:20:00 -0700


bartekkuncer commented on PR #21127:
URL: 
https://github.com/apache/incubator-mxnet/pull/21127#issuecomment-1227181142


   > ## Description
   > Added tutorial showing advantage of using INC with MXNet for quantization. 
It shows that INC can find operators mostly introduced loss of accuracy and 
eliminate it from quantization. This way partially quantized model achieves 
accuracy results almost the same as original floating point model, but with 
about 3 times performance improvement in comparison to optimized floating point 
model (or 30 times in comparison to not optimized floating point model)
   > 
   > 
![image](https://user-images.githubusercontent.com/58251767/186166723-9754dabf-1985-4b07-b816-bf2ddc12a0a6.png)
 
![image](https://user-images.githubusercontent.com/58251767/186166755-69f52542-1e59-42e9-b857-36a1b1b9288a.png)
   > 
   > ## Checklist
   > ### Essentials
   > * [x]  PR's title starts with a category (e.g. [BUGFIX], [MODEL], 
[TUTORIAL], [FEATURE], [DOC], etc)
   > * [x]  Changes are complete (i.e. I finished coding on this PR)
   > * [ ]  All changes have test coverage
   > * [ ]  Code is well-documented
   
   I assume this graphics depicts the results for resnet50 v2? I believe it 
would be nice to add information about the workload in the description.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [incubator-mxnet] bartekkuncer commented on pull request #21127: [DOC] Add tutotrial about improving accuracy of quantization with oneDNN

Reply via email to