2014-08-23 21:25 GMT+02:00 Lars Buitinck <larsm...@gmail.com>: > I was just implementing tf-chi2 today (I have a text classification > task to improve anyway), so I might send a PR somewhere over the next > week to at least establish the API. Supervised term weighting is > pretty big, with hundreds of citations for the major papers.
Later than I hoped and not a PR but a Gist with unfinished work: https://gist.github.com/larsmans/239fecd3fc6b49e50da9 This stuff got stalled because (1) it didn't work on my dataset, so I had to come up with something different, and (2) it requires some work to integrate with sklearn.feature_selection. I had to modify the way chi² test values are computed. I'm still interested in this, an I hope others are, so your feedback is welcome. Please use the ML, because I don't get a ping for comments on Gists. ------------------------------------------------------------------------------ Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer http://pubads.g.doubleclick.net/gampad/clk?id=154622311&iu=/4140/ostg.clktrk _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general