manupa-arm commented on pull request #9164:
URL: https://github.com/apache/tvm/pull/9164#issuecomment-937658602


   > In 2), if we run legalization on partitioned functions, wouldn't that 
decompose all QNN ops?
   
   I think the constant folding pass is supposed to work in the IRModule (with 
the external function). Therefore, everything in the IRModule will be affected. 
However, we could create IRModules with what is in-scope for the transformation.
   
   > I needed to retain QNN ops all the way until I translated them to the 
external IR, so running legalization had never been my option. I did wish that 
we could selectively lower const-foldable QNN subgraphs only. Maybe I'm missing 
something.
   
   It is about further granularity one would to do further partitioning. Today, 
I think we need to do further partitioning to achieve this. However, whether we 
want to annotations to block constant folding seems like an interesting but an 
orthogonal conversation to this one.
   
   In the scope of changes in this PR, I feel it does the same thing (destroys 
QNN info in the process of constant folding). However, we could control what we 
want to pass into the Constant Folding Pass.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to