[ https://issues.apache.org/jira/browse/SPARK-7290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16014039#comment-16014039 ]
Nick Pentreath commented on SPARK-7290: --------------------------------------- Is this still desired? Seems it perhaps doesn't add that much over existing functionality. We should perhaps rather focus on things like adding multi-col support for {{StringIndexer}} and {{OneHotEncoder}} to enhance efficiency. > Add StringVectorizer > -------------------- > > Key: SPARK-7290 > URL: https://issues.apache.org/jira/browse/SPARK-7290 > Project: Spark > Issue Type: New Feature > Components: ML > Reporter: Xiangrui Meng > > `StringVectorizer` maps a String column into a vector column with binary > values. The output should be the same as chaining `StringIndexer` and > `OneHotEncoder`. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org