FYI in the context of transforming only certain fields, you might also be interested in this addition to FeatureUnion, which will handle parts of dicts or dataframes: https://github.com/scikit-learn/scikit-learn/pull/3886
It's still waiting for reviews, though... On 03/03/2015 05:46 PM, Jason Wolosonovich wrote: > Andreas, > > Thank you very much for the response, your explanation makes sense. Pandas > has the get_dummies() method that I've used (and then dropped one of each of > the categorical indicators to prevent multicolinearity) but I'll check out > One-Hot Encoder for that purpose as well. > > Sebastian, > > Thank you as well for an excellent suggestion on creating a custom class that > houses custom preprocessing routines, I hadn't thought of that but it makes > perfect sense and provides the functionality I'm looking for while still > being able to take advantage of all the great things scikit-learn does. > > -Jason > > -----Original Message----- > From: scikit-learn-general-requ...@lists.sourceforge.net > [mailto:scikit-learn-general-requ...@lists.sourceforge.net] > Sent: Tuesday, March 03, 2015 3:38 AM > To: scikit-learn-general@lists.sourceforge.net > Subject: Scikit-learn-general Digest, Vol 62, Issue 4 > > Send Scikit-learn-general mailing list submissions to > scikit-learn-general@lists.sourceforge.net > > To subscribe or unsubscribe via the World Wide Web, visit > https://lists.sourceforge.net/lists/listinfo/scikit-learn-general > or, via email, send a message with subject or body 'help' to > scikit-learn-general-requ...@lists.sourceforge.net > > You can reach the person managing the list at > scikit-learn-general-ow...@lists.sourceforge.net > > When replying, please edit your Subject line so it is more specific than "Re: > Contents of Scikit-learn-general digest..." > > ------------------------------------------------------------------------------ > Dive into the World of Parallel Programming The Go Parallel Website, sponsored > by Intel and developed in partnership with Slashdot Media, is your hub for all > things parallel software development, from weekly thought leadership blogs to > news, videos, case studies, tutorials and more. Take a look and join the > conversation now. http://goparallel.sourceforge.net/ > _______________________________________________ > Scikit-learn-general mailing list > Scikit-learn-general@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/scikit-learn-general ------------------------------------------------------------------------------ Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general