[PR] [CodeGen][CUDA] Move fast math intrinsic lowering option to PassContext [tvm]

via GitHub Sat, 23 May 2026 19:20:21 -0700


tlopex opened a new pull request, #19596:
URL: https://github.com/apache/tvm/pull/19596


   This updates CUDA fast math intrinsic lowering to use a PassContext option 
instead of a CUDA Target attribute.
   
   The new option is:
   
   ```python
   with tvm.transform.PassContext(config={"tirx.enable_fast_math": True}):
       ...
   ```
   
   When unset or false, CUDA math intrinsics continue to lower to the precise 
CUDA math functions such as expf. When true, tirx.LowerIntrin prioritizes the 
cuda.fastmath.* lowering rules, producing fast math intrinsics such as __expf.
   
   Changes:
   
   Register tirx.enable_fast_math as a PassContext option.
   Read the option in tirx.LowerIntrin and LowerIntrinStmt.
   Remove enable_fast_math from CUDA Target attributes, target detection, and 
CUDA target tags.
   Update CUDA fast math and target tag tests.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[PR] [CodeGen][CUDA] Move fast math intrinsic lowering option to PassContext [tvm]

Reply via email to