mbrookhart commented on pull request #7303:
URL: https://github.com/apache/tvm/pull/7303#issuecomment-763245797


   Yeah, scanning on the non-inner axis will have a cache locality performance 
hit, but I'm honestly not sure if that would be better or worse than the 
overhead from doing a pair of reshape/transpose ops. Reshape and transpose are 
heavily limited by memory bandwidth.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to