giuseros opened a new pull request #6907:
URL: https://github.com/apache/incubator-tvm/pull/6907


   - I refactored the assembly in arm_cpu/tensor_intrin.py to use LLVM+TIR
   - Removed the `interleave` boolean parameter in the intrinsic to switch
   among two different interleaving modes. LLVM will now take care of
   interleaving the instructions
   - Applied the changes accordingly to conv2d_gemm.py to call the right
   instrinsic
   
   Note: I found LLVM very sensible to the choice of the `-mcpu`.
   So, in order to preserve performance, it is important to specify the
   right `-mcpu` when creating the LLVM target


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to