zhengruifeng commented on a change in pull request #26803: [SPARK-30178][ML]
RobustScaler support large numFeatures
URL: https://github.com/apache/spark/pull/26803#discussion_r357004365
##########
File path: mllib/src/main/scala/org/apache/spark/ml/util/MetadataUtils.scala
##########
@@ -41,6 +41,24 @@ private[spark] object MetadataUtils {
}
}
+ /**
+ * Examine a schema to identify the number of features in a vector column.
+ * Returns None if the number of features is not specified.
+ */
+ def getNumFeatures(vectorSchema: StructField): Option[Int] = {
Review comment:
yes, there is still something wrong in that PR, I guess it will take me some
time to resolve it.
I will rebase that PR if this one is merged.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]