anijain2305 opened a new pull request #6039: URL: https://github.com/apache/incubator-tvm/pull/6039
MXnet pre-quantized BERT model - https://gluon-nlp.mxnet.io/examples/sentence_embedding/bert.html#Quantize-the-model Features added in this PR * Support for Tensor quantization for MXNet Dense operator * Support for Channel quantization for MXNet Dense operator * Adding channel wise support for dequantization * Support softmax use_length for axis=-1 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
