This is explained here

http://scikit-learn.org/stable/modules/ensemble.html#random-forests:

"In addition, when splitting a node during the construction of the tree, the 
split that is chosen is no longer the best split among all features. Instead, 
the split that is picked is the best split among a random subset of the 
features."

and the "best split" (in the decision trees) among the random feature subset is 
based on maximizing information gain or equivalently minimizing child node 
impurity as described here: 
http://scikit-learn.org/stable/modules/tree.html#mathematical-formulation



Looking at this, I have a question though ...

In the docs 
(http://scikit-learn.org/stable/modules/tree.html#mathematical-formulation) it 
says

"Select the parameters that minimises the impurity"

and

"Recurse for subsets Q_left and Q_right until the maximum allowable depth is 
reached"

So but this is basically not the whole definition, right? There should be 
condition that if the weighted average of the child node impurities for any 
given feature is not smaller than the parent node impurity, the tree growing 
algorithm would terminate, right?

Best,
Sebastian

> On Oct 2, 2018, at 10:49 AM, Guillaume Lemaître <g.lemaitr...@gmail.com> 
> wrote:
> 
> In Random Forest, the best split for each feature is selected. The
> Extra Randomized Trees will make a random split instead.
> On Tue, 2 Oct 2018 at 17:43, Michael Reupold
> <michael_reup...@trimble.com> wrote:
>> 
>> Hello all,
>> I currently struggle to find information what or which specific split 
>> Methods are used on the RandomForestClassifier. Is it a random selection? A 
>> median? The best of a set of methods?
>> 
>> Kind regards
>> 
>> Michael Reupold
>> 
>> _______________________________________________
>> scikit-learn mailing list
>> scikit-learn@python.org
>> https://mail.python.org/mailman/listinfo/scikit-learn
> 
> 
> 
> -- 
> Guillaume Lemaitre
> INRIA Saclay - Parietal team
> Center for Data Science Paris-Saclay
> https://glemaitre.github.io/
> _______________________________________________
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to