Xiangrui Meng created SPARK-5895:
------------------------------------

             Summary: Add VectorSlicer
                 Key: SPARK-5895
                 URL: https://issues.apache.org/jira/browse/SPARK-5895
             Project: Spark
          Issue Type: Sub-task
          Components: ML
            Reporter: Xiangrui Meng


`VectorSlicer` takes a vector column and output a vector column with a subset 
of features.

{code}
val vs = new VectorSlicer()
  .setInputCol("user")
  .setSelectedFeatures("age", "salary")
  .setOutputCol("usefulUserFeatures")
{code}

We should allow specifying selected features by indices and by names. It should 
preserve the output names.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to