Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Andreas Mueller
Hi Matthieu. The release contains metric and non-metric MDS. Details can be found here: http://scikit-learn.org/stable/modules/manifold.html#multidimensional-scaling I don't think the implementation includes classical MDS, as that would be redundant, like you said. Hope that helps, Andy On 09

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Andreas Mueller
On 09/04/2012 11:50 PM, Zach Bastick wrote: Congratulations to all the developers! This should save a lot of time for people trying to find the new features that were in the dev version but weren't in 0.11, like me :) Cheers, Zach One particular improvement (not) just for you: If you use t

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Matthieu Brucher
Excellent work! I have a question on MDS. Is it the classic MDS or something else? (Asking the question as PCA is the classic MDS). It seems to be when the distance matrix is Euclidean? Cheers, Matthieu 2012/9/5 Andreas Mueller > Dear fellow Pythonistas. > I am pleased to announce the release

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread bthirion
Congrats ! And again, thanks, Andy, B On 09/05/2012 12:38 AM, Andreas Mueller wrote: Dear fellow Pythonistas. I am pleased to announce the release of scikit-learn 0.12. This release adds several new features, for example multidimensional scaling (MDS), multi-task Lasso and multi-output decision

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Satrajit Ghosh
> > Thanks for all the work, Andy! > +1 cheers, satra -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. D

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Robert Layton
On 5 September 2012 09:08, Olivier Grisel wrote: > 2012/9/5 Lars Buitinck : > > 2012/9/5 Andreas Mueller : > >> I am pleased to announce the release of scikit-learn 0.12. > >> This release adds several new features, for example > >> multidimensional scaling (MDS), multi-task Lasso > >> and multi-

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Olivier Grisel
2012/9/5 Lars Buitinck : > 2012/9/5 Andreas Mueller : >> I am pleased to announce the release of scikit-learn 0.12. >> This release adds several new features, for example >> multidimensional scaling (MDS), multi-task Lasso >> and multi-output decision and regression forests. > > Thanks for all the

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Lars Buitinck
2012/9/5 Andreas Mueller : > I am pleased to announce the release of scikit-learn 0.12. > This release adds several new features, for example > multidimensional scaling (MDS), multi-task Lasso > and multi-output decision and regression forests. Thanks for all the work, Andy! -- Lars Buitinck Sci

Re: [Scikit-learn-general] Whatever happened to moving the website

2012-09-04 Thread Olivier Grisel
2012/9/4 Andreas Mueller : > Hey everybody. > I was just wondering what happened to moving the website. > What was the problem again? > Can we maybe open an issue or something with a todo? The lack of ability to do redirects with github.com There is a bunch of redirects in place on the sourceforg

Re: [Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Zach Bastick
Congratulations to all the developers! This should save a lot of time for people trying to find the new features that were in the dev version but weren't in 0.11, like me :) Cheers, Zach On 05/09/2012 00:38, Andreas Mueller wrote: Dear fellow Pythonistas. I am pleased to announce the relea

[Scikit-learn-general] ANN: scikit-learn 0.12

2012-09-04 Thread Andreas Mueller
Dear fellow Pythonistas. I am pleased to announce the release of scikit-learn 0.12. This release adds several new features, for example multidimensional scaling (MDS), multi-task Lasso and multi-output decision and regression forests. There has also been a lot of progress in documentation and eas

[Scikit-learn-general] Whatever happened to moving the website

2012-09-04 Thread Andreas Mueller
Hey everybody. I was just wondering what happened to moving the website. What was the problem again? Can we maybe open an issue or something with a todo? Cheers, Andy -- Live Security Virtual Conference Exclusive live eve

Re: [Scikit-learn-general] scale features to specific range

2012-09-04 Thread Andreas Mueller
On 09/04/2012 04:27 PM, Olivier Grisel wrote: > 2012/9/4 Alexandre Gramfort : >>> You can just subtract the minimum and divide by the max-min to get >>> [0, 1] features and then go from there. >>> We could actually add something to do this. >> that would be useful for pipelines so scaling is done o

Re: [Scikit-learn-general] scale features to specific range

2012-09-04 Thread Olivier Grisel
2012/9/4 Alexandre Gramfort : >> You can just subtract the minimum and divide by the max-min to get >> [0, 1] features and then go from there. >> We could actually add something to do this. > > that would be useful for pipelines so scaling is done on training fold > and not on full data. Yes, I wo

Re: [Scikit-learn-general] scale features to specific range

2012-09-04 Thread Alexandre Gramfort
> You can just subtract the minimum and divide by the max-min to get > [0, 1] features and then go from there. > We could actually add something to do this. that would be useful for pipelines so scaling is done on training fold and not on full data. Alex -

Re: [Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Andreas Mueller
X = np.array([[0, 1], [1, 0]]) observed = X Y = np.array([[0,1], [1,0]]) expected = np.dot(np.atleast_2d(Y.mean(axis=0)).T, np.atleast_2d(X.sum(axis=0))) chisquare(observed, expected) > (array([ 1., 1.]), array([ 0., 0.])) > > It may not be pretty,

Re: [Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Andreas Mueller
On 09/04/2012 04:07 PM, Olivier Grisel wrote: > 2012/9/4 Andreas Mueller : >> On 09/04/2012 03:43 PM, Olivier Grisel wrote: >>> 2012/9/4 Andreas Mueller : Hi everybody. I'm pretty new to feature selection stuff and I tried to use the chi2 selection. I got a pvalue of exactly zer

Re: [Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Olivier Grisel
2012/9/4 Andreas Mueller : > On 09/04/2012 03:43 PM, Olivier Grisel wrote: >> 2012/9/4 Andreas Mueller : >>> Hi everybody. >>> I'm pretty new to feature selection stuff and I tried to use the chi2 >>> selection. >>> I got a pvalue of exactly zero on one of the features and one of e-250 >>> on anoth

Re: [Scikit-learn-general] scale features to specific range

2012-09-04 Thread Andreas Mueller
Hi Sheila. I think we don't have a function for this because it seems pretty easy to do yourself. That might be also true for "Scaler" and "Normalizer", though. You can just subtract the minimum and divide by the max-min to get [0, 1] features and then go from there. We could actually add someth

Re: [Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Lars Buitinck
2012/9/4 Andreas Mueller : > On 09/04/2012 03:23 PM, Lars Buitinck wrote: >> What did the input look like? chi2 expects frequencies, i.e. strictly >> non-negative feature values. >> > The inputs were non-negative, but some >1. That should be perfectly ok, it's designed for the kind of output that

[Scikit-learn-general] scale features to specific range

2012-09-04 Thread Sheila the angel
Hello All, Another short question, How to scale data features to a given range? For example I want to scale my data features to [-1, +1] or [0,100] . sklearn.preprocessing don't have any direct function for this !! Thanks -- Sheila -

Re: [Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Andreas Mueller
On 09/04/2012 03:43 PM, Olivier Grisel wrote: > 2012/9/4 Andreas Mueller : >> Hi everybody. >> I'm pretty new to feature selection stuff and I tried to use the chi2 >> selection. >> I got a pvalue of exactly zero on one of the features and one of e-250 >> on another one. >> That seems a bit fishy,

Re: [Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Olivier Grisel
2012/9/4 Andreas Mueller : > Hi everybody. > I'm pretty new to feature selection stuff and I tried to use the chi2 > selection. > I got a pvalue of exactly zero on one of the features and one of e-250 > on another one. > That seems a bit fishy, in particular as they don't seem to correlate > very s

Re: [Scikit-learn-general] Scikit-learn-general Digest, Vol 32, Issue 10

2012-09-04 Thread Andreas Mueller
On 09/04/2012 03:23 PM, Lars Buitinck wrote: > 2012/9/4 : >> Date: Tue, 04 Sep 2012 14:49:44 +0100 >> From: Andreas Mueller >> Subject: [Scikit-learn-general] p values of exactly zero in chi2 >> To: [email protected] >> Message-ID: <[email protected]> >> Co

Re: [Scikit-learn-general] Scikit-learn-general Digest, Vol 32, Issue 10

2012-09-04 Thread Lars Buitinck
2012/9/4 : > Date: Tue, 04 Sep 2012 14:49:44 +0100 > From: Andreas Mueller > Subject: [Scikit-learn-general] p values of exactly zero in chi2 > To: [email protected] > Message-ID: <[email protected]> > Content-Type: text/plain; charset=ISO-8859-1; format=fl

Re: [Scikit-learn-general] Plans on implementing CHAID? (CHi-squared Automatic Interaction Detection)

2012-09-04 Thread Marcos Wolff
Hi Olivier, Ok. I forked scikit-learn on github so I'm aware of pull requests and other activities. If I could contribute to this feature, I would be glad to mantain it with documentations, tests, usage examples, etc. I think it's a very interesting algorithm and it's worth devote time to it. Oh

[Scikit-learn-general] p values of exactly zero in chi2

2012-09-04 Thread Andreas Mueller
Hi everybody. I'm pretty new to feature selection stuff and I tried to use the chi2 selection. I got a pvalue of exactly zero on one of the features and one of e-250 on another one. That seems a bit fishy, in particular as they don't seem to correlate very strongly. Maybe I misunderstood somethi

Re: [Scikit-learn-general] Plans on implementing CHAID? (CHi-squared Automatic Interaction Detection)

2012-09-04 Thread Olivier Grisel
2012/9/4 Marcos Wolff : > Hi, > > I was wondering if there are plans on implementing CHAID techniques for tree > growing > > http://en.wikipedia.org/wiki/CHAID > Gordon Kass 1980's paper: > http://ebookbrowse.com/gdoc.php?id=60655988&url=705c072c97190f9f1c59ac51aa72a258. > > SPSS uses it and: > -it

[Scikit-learn-general] Plans on implementing CHAID? (CHi-squared Automatic Interaction Detection)

2012-09-04 Thread Marcos Wolff
Hi, I was wondering if there are plans on implementing CHAID techniques for tree growing http://en.wikipedia.org/wiki/CHAID Gordon Kass 1980's paper: http://ebookbrowse.com/gdoc.php?id=60655988&url=705c072c97190f9f1c59ac51aa72a258 . SPSS uses it and: -it's very effective for multi-class classifi

Re: [Scikit-learn-general] multiclass classification algorithms in Sklearn

2012-09-04 Thread Olivier Grisel
Indeed, I forgot about naive bayes and LDA. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions

Re: [Scikit-learn-general] multiclass classification algorithms in Sklearn

2012-09-04 Thread Olivier Grisel
2012/9/4 Sheila the angel : > Hello, > I would like to know what are the native multiclass classification > algorithms in Sklearn. > For example SVM are basically binary classification algorithm while Tree > methods are Multiclass algorithm (not sure about this). It all depends on what you call "n

Re: [Scikit-learn-general] multiclass classification algorithms in Sklearn

2012-09-04 Thread Andreas Mueller
Hi Sheila. There is a list here: http://scikit-learn.org/dev/modules/multiclass.html It is not exhaustive, though (maybe it should be). QDA and the other forest-based methods are also multi-class, so are the neighbors based methods. Cheers, Andy On 09/04/2012 09:59 AM, Sheila the angel wrote:

[Scikit-learn-general] multiclass classification algorithms in Sklearn

2012-09-04 Thread Sheila the angel
Hello, I would like to know what are the native multiclass classification algorithms in Sklearn. For example SVM are basically binary classification algorithm while Tree methods are Multiclass algorithm (not sure about this). Do you know any other algorithms? Thanks -- Sheila -