2011/9/30 Jacob VanderPlas <[email protected]>: > Hi all, > I spent a half hour last night trying to understand the text feature > extractors in sklearn.feature_extraction.text. I frankly got nowhere: > it is woefully under-documented, both in doc-strings and the online > documentation. Is there anybody who has a familiarity with these > routines and would be willing to spend some time on the docs? That > would be a huge contribution to the usability of scikit-learn. Thanks > Jake
I agree and I am very sorry for that. In the short term the best source of documentation is the tutorial: http://scikit-learn.github.com/scikit-learn-tutorial/working_with_text_data.html It's still on my todo list to work on simplifying the current API and documenting it correctly in the reference documentation. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel ------------------------------------------------------------------------------ All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2dcopy2 _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
