2014-08-23 20:41 GMT+02:00 Gael Varoquaux <gael.varoqu...@normalesup.org>: > Interesting discussion. Of course, the danger here is that it might be > borderline for the scope of scikit-learn. In case somebody is going to > docstringdo a PR on these topics, I would advise to work on the docstring > and narrative documentation to explain well why this can be useful > not only for text analysis, but also information retrieval, which is a > wider topic.
I was just implementing tf-chi2 today (I have a text classification task to improve anyway), so I might send a PR somewhere over the next week to at least establish the API. Supervised term weighting is pretty big, with hundreds of citations for the major papers. I don't think we should be selling anything as "you can IR with this", though -- practical IR is closer to databases than to machine learning in terms of technology. ------------------------------------------------------------------------------ Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/ _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general