2013/7/28 Ross Boucher <[email protected]>: > Interesting, I've been using DictVectorizer (and one hot coded categorical > data) with Random Forests and getting decent results. Is this just > coincidental, and will I see better results if I combine the categorical > data into a single column?
The thing is that dense one-hot arrays may take up huge amounts of memory, not that they won't get good results with RFs (they can, or so I've been told). -- Lars Buitinck Scientific programmer, ILPS University of Amsterdam ------------------------------------------------------------------------------ See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
