[ https://issues.apache.org/jira/browse/SPARK-11215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16591667#comment-16591667 ]
Barry Becker commented on SPARK-11215: -------------------------------------- Is the main motivation for this feature performance? Can you give a rough estimate of how much performance might improve using this feature when you have a few hundred string valued columns that you apply it to? > Add multiple columns support to StringIndexer > --------------------------------------------- > > Key: SPARK-11215 > URL: https://issues.apache.org/jira/browse/SPARK-11215 > Project: Spark > Issue Type: Improvement > Components: ML > Reporter: Yanbo Liang > Assignee: Yanbo Liang > Priority: Major > > Add multiple columns support to StringIndexer, then users can transform > multiple input columns to multiple output columns simultaneously. See > discussion SPARK-8418. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org