elvin-n commented on pull request #10239:
URL: https://github.com/apache/tvm/pull/10239#issuecomment-1041485446


   @mbrookhart 
   > agreed that the QAT version of the pass (which you're adding here) should 
probably be a separate pass from the explicit fake quantized model ala tflite 
and TensorRT
   
   As I see we did exactly what we agreed - the transformation itself is 
independent from the current one in opposite to the previous my PR 
(FakeQuantizationRewriter and OptionalFakeQuantizationRewriter). Then we 
agreed, if I am not mistaken, to have the only pass to simplify user's life. 
User will be aware about only one function - FakeQuantizeToInteger that he have 
to call in his python code. On the other hand inside the pass, there will be 
two transformations - current one and new one. If we want to have QAT 
transformation optional, ok, let's add parameter to the FakeQuantizeToInteger 
with default value not to call QAT


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to