tlopex opened a new pull request, #19596:
URL: https://github.com/apache/tvm/pull/19596
This updates CUDA fast math intrinsic lowering to use a PassContext option
instead of a CUDA Target attribute.
The new option is:
```python
with tvm.transform.PassContext(config={"tirx.enable_fast_math": True}):
...
```
When unset or false, CUDA math intrinsics continue to lower to the precise
CUDA math functions such as expf. When true, tirx.LowerIntrin prioritizes the
cuda.fastmath.* lowering rules, producing fast math intrinsics such as __expf.
Changes:
Register tirx.enable_fast_math as a PassContext option.
Read the option in tirx.LowerIntrin and LowerIntrinStmt.
Remove enable_fast_math from CUDA Target attributes, target detection, and
CUDA target tags.
Update CUDA fast math and target tag tests.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]