Andres Perez created SPARK-2776:
-----------------------------------

             Summary: Add normalizeByCol method to mllib.util.MLUtils
                 Key: SPARK-2776
                 URL: https://issues.apache.org/jira/browse/SPARK-2776
             Project: Spark
          Issue Type: New Feature
            Reporter: Andres Perez
            Priority: Minor


Add the ability to compute the mean and standard deviations of each vector 
(LabeledPoint) component and normalize each vector in the RDD, using only RDD 
transformations. The result is an RDD of Vectors where each column has a mean 
of zero and standard deviation of one.

See https://github.com/apache/spark/pull/1698



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to