2013/7/28 Ross Boucher <[email protected]>:
> Interesting, I've been using DictVectorizer (and one hot coded categorical
> data) with Random Forests and getting decent results. Is this just
> coincidental, and will I see better results if I combine the categorical
> data into a single column?

The thing is that dense one-hot arrays may take up huge amounts of
memory, not that they won't get good results with RFs (they can, or so
I've been told).

-- 
Lars Buitinck
Scientific programmer, ILPS
University of Amsterdam

------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to