Re: [scikit-learn] How to get the most important features from a RF efficiently

2016-07-21 Thread Raphael C
The problem was that I had a loop like for i in xrange(len(clf.feature_importances_)): print clf.feature_importances_[i] which recomputes the feature importance array in every step. Obvious in hindsight. Raphael On 21 July 2016 at 16:22, Raphael C wrote: > I have a set of feature vectors

[scikit-learn] How to get the most important features from a RF efficiently

2016-07-21 Thread Raphael C
I have a set of feature vectors associated with binary class labels, each of which has about 40,000 features. I can train a random forest classifier in sklearn which works well. I would however like to see the most important features. I tried simply printing out forest.feature_importances_ but thi