[ https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077994#comment-14077994 ]
Xiangrui Meng commented on SPARK-2206: -------------------------------------- [~manishamde] I'm thinking about adding a Dataset class with Metadata (DatasetInfo). Given a classification dataset, we can either infer the number of classes automatically or validate the number of classes if it is given by the user. This is a general method, not specific to decision trees. I changed the target version to 1.2.0 and I will ping you after we add the Dataset class. > Automatically infer the number of classification classes in multiclass > classification > ------------------------------------------------------------------------------------- > > Key: SPARK-2206 > URL: https://issues.apache.org/jira/browse/SPARK-2206 > Project: Spark > Issue Type: New Feature > Components: MLlib > Affects Versions: 1.0.0 > Reporter: Manish Amde > Assignee: Manish Amde > > Currently, the user needs to specify the numClassesForClassification > parameter explicitly during multiclass classification for decision trees. > This feature will automatically infer this information (and possibly class > histograms) from the training data. -- This message was sent by Atlassian JIRA (v6.2#6252)