Re: [Scikit-learn-general] algorithm used to train the tree with option 'best'

2014-09-12 Thread Gilles Louppe
Hi Luca, The best strategy consists in finding the best threshold, that is the one that maximizes impurity decrease, when trying to partition a node into a left and right nodes. By contrast, random does not look for the best split and simply draw the discretization threshold at random. For

Re: [Scikit-learn-general] algorithm used to train the tree with option 'best'

2014-09-12 Thread Gilles Louppe
Yes, exactly. Le 12 sept. 2014 18:31, Luca Puggini lucapug...@gmail.com a écrit : Hey thanks a lot, so basically in random Forest the split is done like in the algorithm described in your thesis except that the search is not done on all the variables but only on a random subset of them?