Github user mgaido91 commented on a diff in the pull request:
https://github.com/apache/spark/pull/22032#discussion_r208599920
--- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/LSH.scala ---
@@ -97,7 +97,8 @@ private[ml] abstract class LSHModel[T <: LSHModel[T]]
override def transform(dataset: Dataset[_]): DataFrame = {
transformSchema(dataset.schema, logging = true)
- val transformUDF = udf(hashFunction, DataTypes.createArrayType(new
VectorUDT))
+ val transformUDF = udf({ v: Vector => hashFunction(v) },
--- End diff --
nit: why not `hashFunction _`?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]