I may be misunderstanding something here, but Is the thresholding applied to the output? My understanding was it's usually applied to the weights during quantization.
[ Full content available at: https://github.com/apache/incubator-mxnet/pull/12530 ] This message was relayed via gitbox.apache.org for [email protected]
