masahi commented on pull request #9898: URL: https://github.com/apache/tvm/pull/9898#issuecomment-1013506772
ping @Laurawly @Hzfengsy @comaniac what do you think? My conclusion is that cuDNN on NHWC + int8 is broken. Our test case at https://github.com/apache/tvm/blob/70de68a38f6738255cc71fd9a38964b23f2c5f55/tests/python/contrib/test_cudnn.py#L107 is also broken since it initializes input tensors with `np.random.uniform(-1, 1)` and all outputs are zero. But with this patch, we can at least compile quantized models having `qnn.dense` such as int8 BERT. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
