vvchernov commented on PR #13472: URL: https://github.com/apache/tvm/pull/13472#issuecomment-1331017399
Hello @AndrewZhaoLuo! I've done this fix to support quantized distilbert-like model with mixed-precision in QLinearMatMut. The model is supported by ONNX RT. After this fix and fix from #13469 the model is supported by TVM with VirtualMachine. Nevertheless you are right the primitive op with mix-precision does not work on ORT and more over for TVM too. I'm working on it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
