Hi Peter. I only skimmed your mail, but I understood you said that the problem is the use of a boolean mask. Wouldn't it be possible to do the subsampling explicitly before training the tree if the sample_fraction is low? Or is the complexity of applying the sample mask higher than training the tree?
Also: would it be possible to speed this up using the recently introduced sample weights? That helped for the random forests, right? Best, Andy ------------------------------------------------------------------------------ Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS, MVC, Windows 8 Apps, JavaScript and much more. Keep your skills current with LearnDevNow - 3,200 step-by-step video tutorials by Microsoft MVPs and experts. SALE $99.99 this month only -- learn more at: http://p.sf.net/sfu/learnmore_122412 _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
