[ 
https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077994#comment-14077994
 ] 

Xiangrui Meng commented on SPARK-2206:
--------------------------------------

[~manishamde] I'm thinking about adding a Dataset class with Metadata 
(DatasetInfo). Given a classification dataset, we can either infer the number 
of classes automatically or validate the number of classes if it is given by 
the user. This is a general method, not specific to decision trees. I changed 
the target version to 1.2.0 and I will ping you after we add the Dataset class.

> Automatically infer the number of classification classes in multiclass 
> classification
> -------------------------------------------------------------------------------------
>
>                 Key: SPARK-2206
>                 URL: https://issues.apache.org/jira/browse/SPARK-2206
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>    Affects Versions: 1.0.0
>            Reporter: Manish Amde
>            Assignee: Manish Amde
>
> Currently, the user needs to specify the numClassesForClassification 
> parameter explicitly during multiclass classification for decision trees. 
> This feature will automatically infer this information (and possibly class 
> histograms) from the training data.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to