Re: [Scikit-learn-general] Test coverage is slowly decreasing

2012-07-26 Thread Gael Varoquaux
On Thu, Jul 26, 2012 at 08:01:03AM -0700, Jake Vanderplas wrote: > If we think that it's worth updating our arpack clone and copying the > test suite from scipy, I'd be happy to prepare a PR. It would make our > arpack port more stable and increase our test coverage by ~500 lines, > which is a

Re: [Scikit-learn-general] Test coverage is slowly decreasing

2012-07-26 Thread Olivier Grisel
2012/7/26 Jake Vanderplas : > Hi, > On 07/26/2012 02:37 AM, Gael Varoquaux wrote: > >> Awesome! I just want to thank the team: after I sent this email a lot of >> different people pitched in and we are now with a coverage level that is >> above where we were a couple month ago: >> https://jenkins.

Re: [Scikit-learn-general] Test coverage is slowly decreasing

2012-07-26 Thread Jake Vanderplas
Hi, On 07/26/2012 02:37 AM, Gael Varoquaux wrote: > Awesome! I just want to thank the team: after I sent this email a lot of > different people pitched in and we are now with a coverage level that is > above where we were a couple month ago: > https://jenkins.shiningpanda.com/scikit-learn/job/pyth

Re: [Scikit-learn-general] Test coverage is slowly decreasing

2012-07-26 Thread Andreas Müller
> > On Tue, Jul 24, 2012 at 11:32:19AM +0200, Gael Varoquaux wrote: > > On Sat, Jul 21, 2012 at 12:18:46AM +0200, Olivier Grisel wrote: > > > Recent merges have slowly decreased the test coverage ratio of > > > the > > > code base (from nearly 90% down to 85% now): > > > This email discusses how

Re: [Scikit-learn-general] Test coverage is slowly decreasing

2012-07-26 Thread Gael Varoquaux
On Tue, Jul 24, 2012 at 11:32:19AM +0200, Gael Varoquaux wrote: > On Sat, Jul 21, 2012 at 12:18:46AM +0200, Olivier Grisel wrote: > > Recent merges have slowly decreased the test coverage ratio of the > > code base (from nearly 90% down to 85% now): > This email discusses how to do something about

Re: [Scikit-learn-general] oversampling and undersampling methods

2012-07-26 Thread Olivier Grisel
Have you tried class_weight='auto'? -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will inc

Re: [Scikit-learn-general] oversampling and undersampling methods

2012-07-26 Thread Sheila the angel
May be this will clarify problem - I have unbalanced class data in which I am performing SVM classification. Classification with equal weight for both class gives very low sensitivity etc. (misclassification of low data class). I tried give different class weight but I am not sure how to do it auto

Re: [Scikit-learn-general] [Scikit-learn-commits] [scikit-learn/scikit-learn] 01cf17: DOC: notes on improving code coverage

2012-07-26 Thread Olivier Grisel
2012/7/26 Gael Varoquaux : > On Thu, Jul 26, 2012 at 10:48:48AM +0200, Olivier Grisel wrote: >> Note that jenkins automatically publishes code coverage reports with >> annotated source code: > > Yes, but they don't match what I get on my box. They are computed with `make test-coverage` as well on

Re: [Scikit-learn-general] [Scikit-learn-commits] [scikit-learn/scikit-learn] 01cf17: DOC: notes on improving code coverage

2012-07-26 Thread Gael Varoquaux
On Thu, Jul 26, 2012 at 10:48:48AM +0200, Olivier Grisel wrote: > Note that jenkins automatically publishes code coverage reports with > annotated source code: Yes, but they don't match what I get on my box. -- Live Secur

Re: [Scikit-learn-general] [Scikit-learn-commits] [scikit-learn/scikit-learn] 01cf17: DOC: notes on improving code coverage

2012-07-26 Thread Olivier Grisel
Note that jenkins automatically publishes code coverage reports with annotated source code: https://jenkins.shiningpanda.com/scikit-learn/job/python-2.7-numpy-1.5.1-scipy-0.10.0/lastSuccessfulBuild/artifact/coverage/index.html You can order the modules by "coverage %" or by "missing" and get the

Re: [Scikit-learn-general] LogisticRegression to initiate SGDClassifier

2012-07-26 Thread Olivier Grisel
2012/7/25 Fred Mailhot : > Hi all, > > I've got a text classification problem on which LogisticRegression > consistently outperforms SGDClassifier(loss="log") by a few percentage > points on the smallish [O(10^5) points] datasets I've been using for initial > development/testing. The data set I'll