Re: [dmlc/tvm] [RFC][Quantization] Support quantized models from TensorflowLite (#2351)

Tianqi Chen Sun, 30 Jun 2019 17:39:04 -0700

re "we might have to create a long sequence of existing Relay ops to 
approximate the FP32 computation".


This is certainly a problem for traditional frameworks, but won't be a problem 
for tvm/relay. Because we has automatic fusion and code generation, the long 
sequence of ops will be fused again into a single fused op. We can generate 
code as efficient, sometimes even more efficient(because we can fuse different 
ops together). So I will always recommend breaking things down to primitive ops 
if possible.
 



-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/dmlc/tvm/issues/2351#issuecomment-507081221

Re: [dmlc/tvm] [RFC][Quantization] Support quantized models from TensorflowLite (#2351)

Reply via email to