Julian King created SPARK-23730:
-----------------------------------

             Summary: Save and expose "in bag" tracking for random forest model
                 Key: SPARK-23730
                 URL: https://issues.apache.org/jira/browse/SPARK-23730
             Project: Spark
          Issue Type: Improvement
          Components: ML
    Affects Versions: 2.3.0
            Reporter: Julian King


In a random forest model, it is often useful to be able to keep track of which 
samples ended up in each of the bootstrap replications (and how many times this 
happened). For instance, in the R randomForest package this is accomplished 
through the option keep.inbag=TRUE

Similar functionality in Spark ML's random forest would be helpful



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to