masahi commented on pull request #9898:
URL: https://github.com/apache/tvm/pull/9898#issuecomment-1013506772


   ping @Laurawly @Hzfengsy @comaniac what do you think? My conclusion is that 
cuDNN on NHWC + int8 is broken. Our test case at 
https://github.com/apache/tvm/blob/70de68a38f6738255cc71fd9a38964b23f2c5f55/tests/python/contrib/test_cudnn.py#L107
 is also broken since it initializes input tensors with `np.random.uniform(-1, 
1)` and all outputs are zero.
   
   But with this patch, we can at least compile quantized models having 
`qnn.dense` such as int8 BERT. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to