[GitHub] [tvm] Goose-Bomb commented on issue #13545: [Bug][FQ2I] Failed to run FakeQuantizationToInteger on QDQ ONNX model

GitBox Wed, 07 Dec 2022 18:29:32 -0800


Goose-Bomb commented on issue #13545:
URL: https://github.com/apache/tvm/issues/13545#issuecomment-1341893797


   > In the mean time, here is the branch that makes FQ2I work for your model 
https://github.com/apache/tvm/compare/main...masahi:fq2i-dense-add-fix?expand=1
   > 
   > I've added a hack to the ONNX importer to convert size-1 scale and zp 
tensors in your model to scalars. But as I said, you should update the way 
these params are stored in your model.
   
   I found that using rank-1 and size-1 tensor to represent per-tensor scale 
and zp is also the behavior of ONNXRuntime's quantizer, so is this something 
TVM's ONNX frontend should take into consideration? Since user has no control 
over how these quantizer are implemented.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [tvm] Goose-Bomb commented on issue #13545: [Bug][FQ2I] Failed to run FakeQuantizationToInteger on QDQ ONNX model

Reply via email to