[ 
https://issues.apache.org/jira/browse/SPARK-12773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen resolved SPARK-12773.
-------------------------------
          Resolution: Invalid
    Target Version/s:   (was: 1.5.2)

Please ask questions at [email protected]

> Impurity and Sample details for each node of a decision tree
> ------------------------------------------------------------
>
>                 Key: SPARK-12773
>                 URL: https://issues.apache.org/jira/browse/SPARK-12773
>             Project: Spark
>          Issue Type: Question
>          Components: ML, MLlib
>    Affects Versions: 1.5.2
>            Reporter: Rahul Tanwani
>
> I just want to understand if each node in the decision tree calculates / 
> stores information about no. of samples that satisfy the split criteria. 
> Looking at the code, I find some information about the impurity statistics 
> but did not find anything on the samples. Sci-kit learn exposes both of these 
> metrics. The information may help in the cases where there are multiple 
> decision rules (multiple leaf nodes) yielding the same prediction and we want 
> to do some relative comparisions of decision paths.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to