2014-08-23 21:25 GMT+02:00 Lars Buitinck <larsm...@gmail.com>:
> I was just implementing tf-chi2 today (I have a text classification
> task to improve anyway), so I might send a PR somewhere over the next
> week to at least establish the API. Supervised term weighting is
> pretty big, with hundreds of citations for the major papers.

Later than I hoped and not a PR but a Gist with unfinished work:

https://gist.github.com/larsmans/239fecd3fc6b49e50da9

This stuff got stalled because (1) it didn't work on my dataset, so I
had to come up with something different, and (2) it requires some work
to integrate with sklearn.feature_selection. I had to modify the way
chi² test values are computed.

I'm still interested in this, an I hope others are, so your feedback
is welcome. Please use the ML, because I don't get a ping for comments
on Gists.

------------------------------------------------------------------------------
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://pubads.g.doubleclick.net/gampad/clk?id=154622311&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to