Re: [Scikit-learn-general] SGDClassifier: multiclass predict_proba() ?

2012-07-10 Thread Emanuele Olivetti
On 07/10/2012 11:08 PM, Olivier Grisel wrote: > 2012/7/10 Andreas Mueller : >> Hi Emanuel. >> Is there a reason not to train multinomial logistic regression >> (other than that it is not finished yet) ? >> I think it would be more straight-forward and any help >> on the multinomial logistic regress

Re: [Scikit-learn-general] SGDClassifier: multiclass predict_proba() ?

2012-07-10 Thread Andreas Mueller
On 07/10/2012 10:08 PM, Olivier Grisel wrote: > 2012/7/10 Andreas Mueller : >> Hi Emanuel. >> Is there a reason not to train multinomial logistic regression >> (other than that it is not finished yet) ? >> I think it would be more straight-forward and any help >> on the multinomial logistic regress

Re: [Scikit-learn-general] SGDClassifier: multiclass predict_proba() ?

2012-07-10 Thread Olivier Grisel
2012/7/10 Andreas Mueller : > Hi Emanuel. > Is there a reason not to train multinomial logistic regression > (other than that it is not finished yet) ? > I think it would be more straight-forward and any help > on the multinomial logistic regression would be great > (I'm very busy at the moment unf

Re: [Scikit-learn-general] Incorporation of extra training examples

2012-07-10 Thread Andreas Mueller
On 07/09/2012 01:32 PM, Philipp Singer wrote: > Am 09.07.2012 13:59, schrieb Vlad Niculae: >> Another (hackish) idea to try would be to keep the labels of the extra >> data bit give it a sample_weight low enough not to override your good >> training data. > That's actually a great and simple idea.

Re: [Scikit-learn-general] SGDClassifier: multiclass predict_proba() ?

2012-07-10 Thread Andreas Mueller
Hi Emanuel. Is there a reason not to train multinomial logistic regression (other than that it is not finished yet) ? I think it would be more straight-forward and any help on the multinomial logistic regression would be great (I'm very busy at the moment unfortunately). Cheers, Andy On 07/09/20

Re: [Scikit-learn-general] Improving multi-core scalability by avoiding useless memory replication

2012-07-10 Thread Peter Prettenhofer
2012/7/10 Lars Buitinck : > 2012/7/10 Olivier Grisel : >> When doing single node multi cpu parallel machine learning (e.g grid >> search, one vs all SGD, random forests), it would be great to avoid >> duplicating memory, especially for the input dataset that is used as a >> readonly resource in mos

Re: [Scikit-learn-general] Improving multi-core scalability by avoiding useless memory replication

2012-07-10 Thread Olivier Grisel
2012/7/10 Olivier Grisel : > 2012/7/10 Lars Buitinck : >> 2012/7/10 Olivier Grisel : >>> When doing single node multi cpu parallel machine learning (e.g grid >>> search, one vs all SGD, random forests), it would be great to avoid >>> duplicating memory, especially for the input dataset that is used

Re: [Scikit-learn-general] Improving multi-core scalability by avoiding useless memory replication

2012-07-10 Thread Olivier Grisel
2012/7/10 Lars Buitinck : > 2012/7/10 Olivier Grisel : >> When doing single node multi cpu parallel machine learning (e.g grid >> search, one vs all SGD, random forests), it would be great to avoid >> duplicating memory, especially for the input dataset that is used as a >> readonly resource in mos

Re: [Scikit-learn-general] Improving multi-core scalability by avoiding useless memory replication

2012-07-10 Thread Lars Buitinck
2012/7/10 Olivier Grisel : > When doing single node multi cpu parallel machine learning (e.g grid > search, one vs all SGD, random forests), it would be great to avoid > duplicating memory, especially for the input dataset that is used as a > readonly resource in most of our common usecases. I may

[Scikit-learn-general] Improving multi-core scalability by avoiding useless memory replication

2012-07-10 Thread Olivier Grisel
Hi all, When doing single node multi cpu parallel machine learning (e.g grid search, one vs all SGD, random forests), it would be great to avoid duplicating memory, especially for the input dataset that is used as a readonly resource in most of our common usecases. This could be done either with

Re: [Scikit-learn-general] congratulations to Peter and to scikit-learn!

2012-07-10 Thread Peter Prettenhofer
Hi Federico, no not yet - I just approached them recently regarding this issue - I let you know as soon as I hear from them. best, Peter 2012/7/10 federico vaggi : > Peter - did you get any updates from Kaggle? If not, is there anything that > we as a community can do to sway them? > > > On Sa

Re: [Scikit-learn-general] Summary of my recent blog post and GSoC progress

2012-07-10 Thread Alexandre Gramfort
> -- > GOAL: Efficiently support multiple regression targets (bidimensional Y) in > all linear models, like ridge regression and orthogonal matching pursuit > currently do. > > STATUS: Pull request under review. can you give the like so everyone can take a look and eventually give a hand to revi

Re: [Scikit-learn-general] Progress and mid-term evaluation: Speedup of coordinate descent for linear models

2012-07-10 Thread iBayer
Exactly, thanks for the clarification. 2012/7/10 Olivier Grisel > Ok so if I understand correctly the sentence: > > > The glmnet implementation is not yet competitive with the current > > implementation. > > Should read: > > """ > The l1+l2 penalized least square regression implemented with > co

Re: [Scikit-learn-general] Summary of my recent blog post and GSoC progress

2012-07-10 Thread Vlad Niculae
As per Gael's request, here is my progress compared to what was initially stated as mid-term goals. Overall the project is behind schedule, but not far, and I am fairly confident about its successful completion. -- GOAL: Set up a running performance benchmark such as speed.pypy.org or Wes McK

[Scikit-learn-general] Progress and mid-term evaluation: Speedup of coordinate descent for linear models

2012-07-10 Thread iBayer
Dear all, since the start of the project I've been in continuous exchange with my mentor (Alexandre Gramfort) via several pull-request comments. There, I've been reporting my status and asked for feedback, when needed. The promptly feedback of Alexandre kept me going and assured me being on the r

Re: [Scikit-learn-general] Progress and mid-term evaluation: Speedup of coordinate descent for linear models

2012-07-10 Thread Olivier Grisel
Ok so if I understand correctly the sentence: > The glmnet implementation is not yet competitive with the current > implementation. Should read: """ The l1+l2 penalized least square regression implemented with coordinate descent and covariance updates not yet competitive with the current impleme

Re: [Scikit-learn-general] Progress and mid-term evaluation: Speedup of coordinate descent for linear models

2012-07-10 Thread iBayer
"Extremely efficient procedures for fitting the entire lasso or elastic-net regularization path for linear regression, logistic and multinomial regression models. The algorithm uses cyclical coordinate descent in a pathwise fashion, as described in the paper: [1] Regularized Paths for Generalize

Re: [Scikit-learn-general] Progress and mid-term evaluation: Speedup of coordinate descent for linear models

2012-07-10 Thread Olivier Grisel
2012/7/10 iBayer : > Dear all, > > since the start of the project I've been in continuous exchange with my > mentor (Alexandre Gramfort) > > via several pull-request comments. There, I've been reporting my status and > asked for feedback, when needed. The promptly feedback of Alexandre kept me > go

[Scikit-learn-general] Euroscipy 2012: early bird registration ending soon

2012-07-10 Thread Emmanuelle Gouillart
Hello, early bird registration for Euroscipy 2012 is soon coming to an end, with the deadline on July 22nd. Don't forget to register soon! Reduced fees are available for academics, students and speakers. Registration takes place online on http://www.euroscipy.org/conference/euroscipy2012. Euroscip