xiaoxuandev commented on code in PR #54676:
URL: https://github.com/apache/spark/pull/54676#discussion_r2935847230


##########
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/mathExpressions.scala:
##########
@@ -418,10 +418,28 @@ case class Cosh(child: Expression) extends 
UnaryMathExpression(math.cosh, "COSH"
   since = "3.0.0",
   group = "math_funcs")
 case class Acosh(child: Expression)
-  extends UnaryMathExpression((x: Double) => StrictMath.log(x + math.sqrt(x * 
x - 1.0)), "ACOSH") {
+  extends UnaryMathExpression((x: Double) => x match {
+    // in case of large values, the square would lead to Infinity; also, - 1 
would be ignored due
+    // to numeric precision. So log(x + sqrt(x * x - 1)) becomes log(2x) = 
log(2) + log(x) for
+    // positive values.
+    case x if x >= Math.sqrt(Double.MaxValue) =>

Review Comment:
   +1,  we should consider use the 2^28 threshold from fdlibm instead of 
sqrt(Double.MaxValue). When x > 2^28, sqrt(x^2 ± 1) equals |x| exactly in 
double precision, so the sqrt is unnecessary in that range. This aligns with 
the reference implementation used by glibc, musl, and OpenJDK's StrictMath.
   
   For acosh specifically, switching to 2^28 skips an unnecessary sqrt call for 
the entire [2^28, ~1.3e154] range with no precision loss, the results are 
bit-for-bit identical.
   
   If we agree on this, I can rebase my fdlibm change on top. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to