[ 
https://issues.apache.org/jira/browse/SPARK-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15243621#comment-15243621
 ] 

Joseph K. Bradley commented on SPARK-14623:
-------------------------------------------

[~hujiayin] Thanks for this.  However, this looks like it duplicates the 
functionality of StringIndexer + OneHotEncoder.  How is this different, other 
than putting them into 1 class?

> add label binarizer 
> --------------------
>
>                 Key: SPARK-14623
>                 URL: https://issues.apache.org/jira/browse/SPARK-14623
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>            Reporter: hujiayin
>            Priority: Minor
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> It relates to https://issues.apache.org/jira/browse/SPARK-7445
> Map the labels to 0/1. 
> For example,
> Input:
> "yellow,green,red,green,0"
> The labels: "0, green, red, yellow"
> Output:
> 0, 0, 0, 1
> 0, 1, 0, 0
> 0, 0, 1, 0
> 0, 1, 0, 0
> 1, 0 ,0, 0



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to