Hi all, first of all, I think that having more feature selection capabilities in scikit-learn would be nice, especially, an algorithm from the wrapper category that also regards dependence/interaction between features.
Regarding the SequentialFeatureSelection class... We actually decided to simplify this a little bit (compared to the mlxtend variant) and only include the "simple" or "regular" forward and backward selection, and not the floating variants. So, we probably don't want to go overboard and have too many comprehensive algos in a core package such as sklearn, but focus on the main ones whereas we can delegate others (e.g., genetic algorithms, which may implementation-wise rely on an external GP package?) to contrib projects? Anyway, regarding the PR ... I didn't mean to drag in on for that long, but between PR and review, other things always came up and I never got around adding the docs -- I actually forgot at some point then. I think the current state is that the implementation is more or less ok and just needs some polishing maybe. Primarily, what's missing though are the docs and more comprehensive unit tests. This is something I can do in the next few days or weeks (now that I am aware of it) but I also wouldn't mind if someone else works on it. So, let me know if you like to work on the PR, and otherwise, I will make a note for next weekend to look into adding the docs. In any case though, I would appreciate feedback regarding the current implementation. Best, Sebastian > On Sep 3, 2018, at 7:50 AM, Guillaume LemaƮtre <g.lemaitr...@gmail.com> wrote: > > I would add that Sequential Forward Selection is on the way to be > ported by Sebastian (@rabst) > to scikit-learn: > > https://github.com/scikit-learn/scikit-learn/pull/8684 > > However, I am sure that Sebastian would be grateful if you wish to > take over the PR and to move it forward. > But Sebastian is probably going to comment himself ;) > > Cheers, > On Mon, 3 Sep 2018 at 13:35, Oliver Tomic <oliverto...@zoho.com> wrote: >> >> Hi Shuki and Yaniv, >> >> the sequential forward selection algorithm is already implemented in the >> mlxtend python package, which is complimentary to scikit learn. >> https://rasbt.github.io/mlxtend/user_guide/feature_selection/SequentialFeatureSelector/ >> >> best wishes >> Oliver >> >> >> >> >> ---- On Mon, 03 Sep 2018 13:17:30 +0200 Shuki Cohen <shok...@gmail.com> >> wrote ---- >> >> >> >> On Mon, Sep 3, 2018 at 1:21 PM Shuki Cohen <shok...@gmail.com> wrote: >> >> Hi all, >> >> Me and a friend of mine found lack of feature selection functionalities in >> Scikit-learn and we thought to contribute in order to answer this need. More >> specifically, we want to add: >> 1. Sequential Forward Selection algorithm >> 2. Multivariate Feature Selection >> to the Scikit-learn code base, and this mail is to get your approval that >> such a project has good chances to be added to the next version. >> >> Thanks in advance >> Shuki & Yaniv >> >> _______________________________________________ >> scikit-learn mailing list >> scikit-learn@python.org >> https://mail.python.org/mailman/listinfo/scikit-learn >> >> >> >> _______________________________________________ >> scikit-learn mailing list >> scikit-learn@python.org >> https://mail.python.org/mailman/listinfo/scikit-learn > > > > -- > Guillaume Lemaitre > INRIA Saclay - Parietal team > Center for Data Science Paris-Saclay > https://glemaitre.github.io/ > _______________________________________________ > scikit-learn mailing list > scikit-learn@python.org > https://mail.python.org/mailman/listinfo/scikit-learn _______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn