Github user setjet commented on a diff in the pull request:
https://github.com/apache/spark/pull/18113#discussion_r153024799
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/aggregate/typedaggregators.scala
---
@@ -99,3 +94,91 @@ class TypedAverage[IN](val f: IN => Double) extends
Aggregator[IN, (Double, Long
toColumn.asInstanceOf[TypedColumn[IN, java.lang.Double]]
}
}
+
+class TypedMinDouble[IN](val f: IN => Double) extends Aggregator[IN,
Double, Double] {
+ override def zero: Double = Double.PositiveInfinity
+ override def reduce(b: Double, a: IN): Double = math.min(b, f(a))
+ override def merge(b1: Double, b2: Double): Double = math.min(b1, b2)
+ override def finish(reduction: Double): Double = {
+ if (Double.PositiveInfinity == reduction) {
--- End diff --
Ok makes sense. What about the return ```finish``` return type? Leaving
that as a java type would cause the ```this``` and ```toColumnJava``` to be
flipped, creating a ```toColumnScala``` instead.
What about:
```
override def finish(reduction: java.lang.Double): Double = reduction
```
As its on the finish, it shouldn't cause much performance overhead as its
not execution many times. It would also reduce complexity a bit.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]