[
https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492892#comment-14492892
]
Joseph K. Bradley commented on SPARK-6884:
------------------------------------------
Is this not a duplicate of [SPARK-3727]? Perhaps the best way to split up the
work will be to make a subtask for trees, and a separate subtask for ensembles.
I'll go ahead and do that.
> random forest predict probabilities functionality (like in sklearn)
> -------------------------------------------------------------------
>
> Key: SPARK-6884
> URL: https://issues.apache.org/jira/browse/SPARK-6884
> Project: Spark
> Issue Type: New Feature
> Components: MLlib
> Affects Versions: 1.3.0
> Environment: cross-platform
> Reporter: Max Kaznady
> Labels: prediction, probability, randomforest, tree
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> Currently, there is no way to extract the class probabilities from the
> RandomForest classifier. I implemented a probability predictor by counting
> votes from individual trees and adding up their votes for "1" and then
> dividing by the total number of votes.
> I opened this ticked to keep track of changes. Will update once I push my
> code to master.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]