The best_score_ is the mean score of the best parameter averaged over folds.
best_estimator_ is the estimator with these parameters retrained on the whole training set.
If this is not clear from the docstring, what alternative would you propose?

Best,
Andy

On 07/25/2014 09:32 PM, Pagliari, Roberto wrote:

I'm not sure if I understand the documentation of gridSearchCV:

best_estimator_

        

estimator

        

Estimator that was chosen by the search, i.e. estimator which gave highest score (or smallest loss if specified) on the left out data.

best_score_

        

float

        

Score of best_estimator on the left out data.

Shouldn't they be the best estimator and best score averaged over the k partition (when using kfold for cross validation)?

If not, is there a way to set this constraint?

Thank you,



------------------------------------------------------------------------------
Want fast and easy access to all the code in your enterprise? Index and
search up to 200,000 lines of code with a free copy of Black Duck
Code Sight - the same software that powers the world's largest code
search on Ohloh, the Black Duck Open Hub! Try it now.
http://p.sf.net/sfu/bds


_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------
Want fast and easy access to all the code in your enterprise? Index and
search up to 200,000 lines of code with a free copy of Black Duck
Code Sight - the same software that powers the world's largest code
search on Ohloh, the Black Duck Open Hub! Try it now.
http://p.sf.net/sfu/bds
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to