> > - In tree-based Not handling categorical variables as such hurts us a lot > There's a PR to fix that, it still needs a bit of love: > https://github.com/scikit-learn/scikit-learn/pull/4899 >
This is a conversation moved from https://github.com/scikit-learn/scikit-learn/pull/4899 . In the light of the comment above and comments in the PR, I was wondering what changes are needed to make RandomForestClassifier competitive with xgboost and H20 at http://datascience.la/benchmarking-random-forest-implementations/ . Raphael ------------------------------------------------------------------------------ Transform Data into Opportunity. Accelerate data analysis in your applications with Intel Data Analytics Acceleration Library. Click to learn more. http://pubads.g.doubleclick.net/gampad/clk?id=278785351&iu=/4140 _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general