2012/11/28 Andreas Mueller <[email protected]>: > Am 28.11.2012 16:46, schrieb Mathieu Blondel: > > > > On Thu, Nov 29, 2012 at 12:33 AM, Andreas Mueller <[email protected]> > wrote: >> >> Do you see where the "sometimes 100x" comes from? >> Not from what he demonstrates, right? >> > scikit-learn is really bad when n_jobs=10. I would be interested in knowing > if the performance gains are mostly coming from the fact that wiseRF is > written in C++ or if they had to use algorithmic improvements. > > Why should C++ be any faster than Cython?
amongst others: template metaprogramming -> see http://lingpipe-blog.com/2011/07/01/why-is-c-so-fast/ if the input data is float64 you need to take conversion to float32 into account; furthermore sklearn will convert to fortran layout -> this will give a huge penalty in memory consumption. > Templating number of bins in leafs? > > Maybe they learned a model to pick good default values for the forest for a > dataset ;) in terms of algorithms and split point evaluation: different strategies are more appropriate for different feature types (lots vs. few split points); > > ------------------------------------------------------------------------------ > Keep yourself connected to Go Parallel: > INSIGHTS What's next for parallel hardware, programming and related areas? > Interviews and blogs by thought leaders keep you ahead of the curve. > http://goparallel.sourceforge.net > _______________________________________________ > Scikit-learn-general mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/scikit-learn-general > -- Peter Prettenhofer ------------------------------------------------------------------------------ Keep yourself connected to Go Parallel: INSIGHTS What's next for parallel hardware, programming and related areas? Interviews and blogs by thought leaders keep you ahead of the curve. http://goparallel.sourceforge.net _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
