Goose-Bomb commented on issue #13545: URL: https://github.com/apache/tvm/issues/13545#issuecomment-1341893797
> In the mean time, here is the branch that makes FQ2I work for your model https://github.com/apache/tvm/compare/main...masahi:fq2i-dense-add-fix?expand=1 > > I've added a hack to the ONNX importer to convert size-1 scale and zp tensors in your model to scalars. But as I said, you should update the way these params are stored in your model. I found that using rank-1 and size-1 tensor to represent per-tensor scale and zp is also the behavior of ONNXRuntime's quantizer, so is this something TVM's ONNX frontend should take into consideration? Since user has no control over how these quantizer are implemented. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
