That is a common way to do it, though not the default behavior of LinearSVC IIRC.

On 01/29/2015 12:54 PM, Sebastian Raschka wrote:
A naive but related question:

doesn't the l1 norm allow for 0-coefficients? That would be one way to get rid of the "not so useful" features.

On Jan 29, 2015, at 11:55 AM, Pagliari, Roberto <rpagli...@appcomsci.com <mailto:rpagli...@appcomsci.com>> wrote:

When using a feature selection algorithm in a pipeline, for example

clf=Pipeline([

  ('feature_selection', LinearSVC(penalty="l1")),

  ('classification', RandomForestClassifier())

])

clf.fit(X, y)

or even a random forest, for that matter, how does sklearn know how many features to keep?
Thank you,
------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now.http://goparallel.sourceforge.net/_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net <mailto:Scikit-learn-general@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general



------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/


_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to