AndrewZhaoLuo opened a new pull request, #12935: URL: https://github.com/apache/tvm/pull/12935
This makes layer_norm relay op dispatch to new topi committed in PR #12864. Using TIR CSE elimination pass with FP16 layernorm also necessitates the handling of FP16 type when packing args for CUDA. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
