Aristide021 commented on PR #18201:
URL: https://github.com/apache/tvm/pull/18201#issuecomment-3215308112

   > @Aristide021 Thanks for the PR! A couple of points and questions:
   > 
   > 1. Status of GNA vs NPU
   >    
   >    * The upstream GNA repo is archived and marked as not under active 
management. The OpenVINO docs also note that GNA is being discontinued and 
recommend using Intel's NPU as the low-power offload path on newer processors. 
Given that, would it make sense to position this backend as a stepping stone 
toward NPU (and/or clarify the long-term maintenance plan in the README/code 
comments)?
   >    * https://github.com/intel/gna
   >    * 
https://docs.openvino.ai/2023.3/openvino_docs_OV_UG_supported_plugins_GNA.html
   > 2. CI & Software Emulation Mode
   >    
   >    * According to the OpenVINO docs, GNA plugin supports Software 
Emulation Mode (CPU fallback) when GNA HW isn't present. If we enable that in 
tests, we could run E2E coverage in our CI.
   > 
   > I also think this backend can serve as a very good example for codegen in 
Relax. It shows a clean and minimal pattern: partitioning with basic ops, 
handing off to JSON, and keeping the implementation relatively lightweight. 
Adding a short HOWTO or developer note ("Writing a minimal Relax backend") that 
references this code could be very helpful for the community.
   > 
   > cc @tqchen @Hzfengsy @cbalint13
   
   Thanks for the review and the excellent points! You're correct about GNA 
being archived. I designed this backend as a stepping stone toward NPU support 
with OpenVINO runtime integration in mind. The JSON serialization approach 
should make the transition to Intel's current NPU path relatively 
straightforward.
   
   For the CI integration with Software Emulation Mode, I think that's a great 
suggestion. I can add CPU fallback support to enable E2E testing without 
requiring actual GNA hardware.
   
   I'd also be happy to add documentation, positioning this as a foundation for 
NPU backends, and include a developer guide if that would be helpful for the 
community.
   
   I'll go ahead and update the PR description to clarify the NPU migration 
path. My next step will be to add CPU emulation support for testing. Please let 
me know if you have any other suggestions.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to