================
@@ -94,6 +94,12 @@ static cl::opt<bool> DisableFDivExpand(
cl::ReallyHidden,
cl::init(false));
+static cl::opt<bool> EnableFP32ReciprocalNewtonRaphson(
+ "enable-fp32-recip-newton-raphson", cl::Hidden, cl::init(false),
+ cl::desc("Use Newton-Raphson refinement for 1.0f/x when the denominator "
+ "is a normal float, falling back to the full division sequence "
+ "for denormals/inf/nan/zero."));
----------------
carlobertolli wrote:
Both TargetOption and FunctionAttribute are not accessible in
AMDGPUCodeGenPrepare in getAnalysisUsage.
I need that information at that point because it's one of the conditions under
which the control flow graph gets modified:
if (!ExpandDiv64InIR && !EnableFP32ReciprocalNewtonRaphson)
AU.setPreservesAll();
https://github.com/llvm/llvm-project/pull/194716
_______________________________________________
cfe-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits