[Scikit-learn-general] RFC (also by users) on interpreting 1d X

Andreas Mueller Fri, 01 May 2015 10:28:12 -0700

Hi all.
A quick questions on future API.

What should happen if a user passes an X with shape (N,), in other wordsX.ndim == 1?


This is unfortunately not really consistent in scikit-learn right now.
Three things are possible:
1) Raise an error
2) N = n_features, that is X contains a single sample
3) N = n_samples, that is X has a single feature

I would think it should be N=n_samples. Gael thinks (iirc) we shouldraise an error.In the code, we currently take N=n_features in predict,decision_function, predict_proba and transform, basically everywhere.This is in part due to using ``check_array`` everywhere, which used thebackward-compatible (but odd) behavior of np.atleast2d.


In ``fit``it looks like all estimators assume |N=n_features|, apart from

DictionaryLearning, MinMaxScaler, StandardScaler, which assume|N=n_samples.|

See https://github.com/scikit-learn/scikit-learn/pull/4511 for morediscussion

Obviously any change we make would mean a deprecation cycle, which willmean warning in 0.17 and 0.18 when someone gives a 1-dim X that we'llchange something soon, and then actually change it in 0.19 (1.0?).


Andy

------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud 
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y

_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

[Scikit-learn-general] RFC (also by users) on interpreting 1d X

Reply via email to