[GitHub] [incubator-tvm] giuseros opened a new pull request #6907: AArch64 base algorithm refactoring in LLVM

GitBox Thu, 12 Nov 2020 11:23:23 -0800


giuseros opened a new pull request #6907:
URL: https://github.com/apache/incubator-tvm/pull/6907



   - I refactored the assembly in arm_cpu/tensor_intrin.py to use LLVM+TIR
   - Removed the `interleave` boolean parameter in the intrinsic to switch
   among two different interleaving modes. LLVM will now take care of
   interleaving the instructions
   - Applied the changes accordingly to conv2d_gemm.py to call the right
   instrinsic
   
   Note: I found LLVM very sensible to the choice of the `-mcpu`.
   So, in order to preserve performance, it is important to specify the
   right `-mcpu` when creating the LLVM target


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [incubator-tvm] giuseros opened a new pull request #6907: AArch64 base algorithm refactoring in LLVM

Reply via email to