[GitHub] [tvm] jikechao opened a new issue #7678: [BUG] The accuracy of the compiled model declines.

GitBox Tue, 16 Mar 2021 20:54:13 -0700


jikechao opened a new issue #7678:
URL: https://github.com/apache/tvm/issues/7678



   When the model xception-imagenet_origin.onnx was compiled, the accuracy of 
the compiled model decreases, from 0.78 to 0.77.
   
   12 pictures have a inconsistent prediction results during 300 pictures. 
   The figure is shown below in the form of : **prediction top_1_class(top_1 
class probability)**
   
![image](https://user-images.githubusercontent.com/29506758/111411012-c8081280-8714-11eb-8820-19245adcffef.png)
   
   Besides, for the consistent results of the prediction, the probability value 
of the prediction also changes. I compared the prediction results of model 
before and after compilation using the statement 
`np.testing.assert_allclose(res_onnx, res_tvm, rtol=1e-3, atol=1e-3) `, and the 
outputs are as follows.
   
![image](https://user-images.githubusercontent.com/29506758/111411061-dd7d3c80-8714-11eb-8b34-1458560ad9a8.png)
   
   **Notice** 
   This bug is not same with the issue 
https://github.com/apache/tvm/issues/7563,  because when I change `opt_level` 
or `batch_size` this bug still exists.
   
   **Environment**
   TVM : 0.8.dev0
   ONNX: 1.8.1
   OS: Ubuntu 16.04
   
   **The reproducible script**
   ```
   # -*- encoding=utf-8 -*-
   import keras
   import tvm
   import tvm.relay as relay
   import numpy as np
   from PIL import Image
   from tvm.contrib import graph_runtime
   import onnx
   import onnxruntime as rt
   
   target = 'llvm -mcpu=core-avx2'
   ctx = tvm.cpu(0)
   
   
   def image_resize(x, shape):
       x_return = []
       for x_test in x:
           tmp = np.copy(x_test)
           img = Image.fromarray(tmp.astype('uint8')).convert('RGB')
           img = img.resize(shape, Image.ANTIALIAS)
           x_return.append(np.array(img))
       return np.array(x_return)
   
   
   def getTestData():
       data = np.load("/share_container/data/dataset/imagenet-val-10.npz")
       x, y = data['x_test'], data['y_test']
       input_shape = (299, 299)
       input_precessor = keras.applications.xception.preprocess_input
       x_resize = image_resize(np.copy(x), input_shape)
       x_test = input_precessor(x_resize)
       y_test = keras.utils.to_categorical(y, num_classes=1000)
       return x_test, y_test
   
   
   def compare(model_name):
       batch_size = 10  # test_pic number
       x_test, y_test = getTestData()  # get test images
       x_test = x_test[:batch_size]
       predict_model = onnx.load(model_name)
   
       sess = rt.InferenceSession(model_name)
       input_name = sess.get_inputs()[0].name
       label_name = sess.get_outputs()[0].name
       res_onnx = sess.run([label_name], {input_name: 
x_test.astype(np.float32)})[0]
   
       # ############################ load keras model to relay IRModule  
##################################
       input_shape = (batch_size, 299, 299, 3)
       output_shape = (batch_size, 1000)
   
       shape_dict = {input_name: input_shape}
   
       irmod, params = relay.frontend.from_onnx(predict_model, shape_dict, 
freeze_params=True)
       irmod = relay.transform.DynamicToStatic()(irmod)
   
       # ########################## Compile the RelayIR 
####################################################
       with tvm.transform.PassContext(opt_level=3):
           graph, lib, params = relay.build_module.build(
               irmod, target=target, params=params)
   
       module = graph_runtime.create(graph, lib, ctx)
   
       data = x_test.astype('float32')
       module.set_input(input_name, data)
       module.set_input(**params)
       module.run()
   
       res_tvm = module.get_output(0, tvm.nd.empty(output_shape)).asnumpy()
       # ###########################  calc tvm accuracy   
###############################
       print("consistent prediction result:")
       print("onnx prediction   TVM prediction)
       for i in range(batch_size):
           ori_class = np.argmax(res_onnx[i])
           compile_class = np.argmax(res_tvm[i])
           if ori_class != compile_class:
               print( str(ori_class) + "(" + str(np.max(res_onnx[i])) + ")\t"
                     + str(compile_class) + "(" + str(np.max(res_tvm[i])) + 
")\t " )
   
       np.testing.assert_allclose(res_onnx, res_tvm, rtol=1e-3, atol=1e-3)
   
   
   if __name__ == '__main__':
       model_name = "xception-imagenet_origin.onnx"
       compare(model_name)
   ```
   
   Anyone can access the model and a small dataset(10 pictures) with this link:
   
https://drive.google.com/drive/folders/1YpO_61A5Z0hycyF1Z8tjxhJj8FIgT3DB?usp=sharing
   
   More details : 
https://discuss.tvm.apache.org/t/the-accuracy-of-the-compiled-model-declines-is-it-a-bug/9434
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [tvm] jikechao opened a new issue #7678: [BUG] The accuracy of the compiled model declines.

Reply via email to