Andres Perez created SPARK-2776:
-----------------------------------
Summary: Add normalizeByCol method to mllib.util.MLUtils
Key: SPARK-2776
URL: https://issues.apache.org/jira/browse/SPARK-2776
Project: Spark
Issue Type: New Feature
Reporter: Andres Perez
Priority: Minor
Add the ability to compute the mean and standard deviations of each vector
(LabeledPoint) component and normalize each vector in the RDD, using only RDD
transformations. The result is an RDD of Vectors where each column has a mean
of zero and standard deviation of one.
See https://github.com/apache/spark/pull/1698
--
This message was sent by Atlassian JIRA
(v6.2#6252)