[
https://issues.apache.org/jira/browse/SPARK-11349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-11349:
----------------------------------
Assignee: Yanbo Liang
> Support transform string label for RFormula
> -------------------------------------------
>
> Key: SPARK-11349
> URL: https://issues.apache.org/jira/browse/SPARK-11349
> Project: Spark
> Issue Type: Sub-task
> Components: ML
> Reporter: Yanbo Liang
> Assignee: Yanbo Liang
>
> Currently RFormula can only handle label with NumericType or BinaryType (cast
> it to DoubleType as the label of Linear Regression training), we should also
> support label of StringType which is needed for Logistic Regression (glm with
> family = "binomial").
> For label of StringType, we should use StringIndexer to transform it to
> 0-based index.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]