manupa-arm commented on a change in pull request #11: URL: https://github.com/apache/tvm-rfcs/pull/11#discussion_r673856838
########## File path: rfcs/0011_Arm_Ethos-U_Integration.md ########## @@ -0,0 +1,233 @@ + Feature Name: Arm® Ethos™-U Integration + Start Date: 2020 May + RFC PR: https://github.com/apache/tvm-rfcs/pull/11 + GitHub Issue: https://github.com/apache/tvm/issues/8482 + +# Motivation + +Arm® Ethos™-U is a series of NPUs that will enable low-cost and highly efficient AI solutions for a wide range of embedded devices. This RFC introduces the port of Ethos-U into the uTVM compilation flow. The process of compilation relies on the multiple levels of abstraction in TVM and a variety of analysis and optimisation passes to produce c output. In the process of compilation, we rely on the many levels of TVM's IR (and the passes) to perform optimizations to create c-sources that can work with current microTVM deployments. + +## Scope: + +### Ethos™-U55 + + + +Ethos™-U55 is a NPU that is designed to uplift ML performance by working as an offload target for micro-controllers. It can accelerate quantized ML operators such as Convolution2D, Depthwise Convolution, Pooling and Elementwise Operators. For convolution-type operators, Ethos-U55 supports hardware enabled loseless de-compression of weights to increase inference performance and reduce power. + +The scope for this RFC is to add support for offloading to the Arm Ethos-U55 NPU. The initial machine learning framework that we use for testing this is TensorFlow Lite. Future RFCs and pull requests will address additional NPUs, such as the Ethos-U65, and other frameworks as the port evolves. + +Please refer to Technical Reference Manual (TRM) for more details – https://developer.arm.com/documentation/102420/0200. +* Reference : https://www.arm.com/products/silicon-ip-cpu/ethos/ethos-u55 + +# Guide-level explanation + +## TVMC User Interface +``` +tvmc compile my_model.tflite +--executor=aot +--output-format=mlf +--target="ethos-u --accelerator-config=ethos-u55-xxx",c" ---> Model Library Format + +# where xxx could be out of possible configuration of the accelerator that can take values : [32, 64, 128, 256] +``` + +The users should be able to use the above command to compile to ethos-u55 that would generate Model Library Format(MLF) output. Review comment: Ack -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
