Each tree is built using a random sample with replacement from the provided 
training data. The data not in the sample is used to calculate the out-of-bag 
score. The “bag” is the sampled data.

The “random” refers to several features of the algorithm, including random 
sampling of features

So for each tree
                Get a random sample of the training data
                For I to n_estimators:
                                Build a tree – this involves a random sample of 
features and thresholds for each feature in the sample at each node.
                                Use the rest of the training data, not in the 
sample, to calculate the out-of-bag score

Random Forest already incorporates “random features”.

https://github.com/glouppe/phd-thesis

__________________________________________________________________________________________
Dale Smith | Macy's Systems and Technology | IFS eCommerce | Data Science
770-658-5176 | 5985 State Bridge Road, Johns Creek, GA 30097 | 
[email protected]

From: scikit-learn 
[mailto:[email protected]] On Behalf Of ??
Sent: Tuesday, September 13, 2016 4:16 AM
To: [email protected]
Subject: [scikit-learn] is RandomForest random samples or random features?

⚠ EXT MSG:
I have read the Guide of sklearn's RandomForest :

"""
In random forests (see 
RandomForestClassifier<http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html#sklearn.ensemble.RandomForestClassifier>
 and 
RandomForestRegressor<http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html#sklearn.ensemble.RandomForestRegressor>
 classes), each tree in the ensemble is built from a sample drawn with 
replacement (i.e., a bootstrap sample) from the training set.
"""
But I prefer RandomForest as :
"""
features ("attributes", "predictors", "independent variables") are randomly 
sampled
"""
is RandomForest random samples or random features? where can I find a features 
random version of RandomForest?
thx.
* This is an EXTERNAL EMAIL. Stop and think before clicking a link or opening 
attachments.
_______________________________________________
scikit-learn mailing list
[email protected]
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to