Re: [Scikit-learn-general] random forests - number of samples

2015-03-11 Thread Andreas Mueller
By default bootstrap=True, so a bootstrap sample is used. That means the number of samples is the same as the original number of samples, but only about 2/3 of the dataset is used, the rest are duplicates. For efficiency, the samples are actually represented using sample weights. On 03/11/2015

[Scikit-learn-general] random forests - number of samples

2015-03-11 Thread Luca Puggini
this can help Message: 4 > Date: Wed, 11 Mar 2015 15:22:29 + > From: "Pagliari, Roberto" > Subject: [Scikit-learn-general] random forests - number of samples > To: "scikit-learn-general@lists.sourceforge.net" > > Message-ID: > < >

[Scikit-learn-general] random forests - number of samples

2015-03-11 Thread Pagliari, Roberto
How many samples does a single tree of a random use? Or does it use all samples? -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media,