Andreas,

Thank you very much for the response, your explanation makes sense. Pandas has 
the get_dummies() method that I've used (and then dropped one of each of the 
categorical indicators to prevent multicolinearity) but I'll check out One-Hot 
Encoder for that purpose as well.

Sebastian,

Thank you as well for an excellent suggestion on creating a custom class that 
houses custom preprocessing routines, I hadn't thought of that but it makes 
perfect sense and provides the functionality I'm looking for while still being 
able to take advantage of all the great things scikit-learn does.

-Jason

-----Original Message-----
From: [email protected] 
[mailto:[email protected]] 
Sent: Tuesday, March 03, 2015 3:38 AM
To: [email protected]
Subject: Scikit-learn-general Digest, Vol 62, Issue 4

Send Scikit-learn-general mailing list submissions to
        [email protected]

To subscribe or unsubscribe via the World Wide Web, visit
        https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
or, via email, send a message with subject or body 'help' to
        [email protected]

You can reach the person managing the list at
        [email protected]

When replying, please edit your Subject line so it is more specific than "Re: 
Contents of Scikit-learn-general digest..."

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to