[
https://issues.apache.org/jira/browse/SPARK-14623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15243621#comment-15243621
]
Joseph K. Bradley commented on SPARK-14623:
-------------------------------------------
[~hujiayin] Thanks for this. However, this looks like it duplicates the
functionality of StringIndexer + OneHotEncoder. How is this different, other
than putting them into 1 class?
> add label binarizer
> --------------------
>
> Key: SPARK-14623
> URL: https://issues.apache.org/jira/browse/SPARK-14623
> Project: Spark
> Issue Type: Improvement
> Components: ML
> Reporter: hujiayin
> Priority: Minor
> Original Estimate: 1h
> Remaining Estimate: 1h
>
> It relates to https://issues.apache.org/jira/browse/SPARK-7445
> Map the labels to 0/1.
> For example,
> Input:
> "yellow,green,red,green,0"
> The labels: "0, green, red, yellow"
> Output:
> 0, 0, 0, 1
> 0, 1, 0, 0
> 0, 0, 1, 0
> 0, 1, 0, 0
> 1, 0 ,0, 0
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]