Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Artem
Oops, missed Reply all once again. Copying the message Yes, the only need for such similarity learners is to use them in a pipeline. It's especially convenient if one wants to do non-linear metric learning using Kernel PCA trick. Then it'd be just another step in the pipeline. What do you mean

Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Mathieu Blondel
Something like this: class SimilarityTransformer(TransformerMixin): def fit(self, X, y): self.X_ = X; return self def transform(self, X): return -euclidean_distances(X, self.X_) On Thu, Mar 26, 2015 at 6:28 PM, Artem barmaley@gmail.com wrote: Yes, the only need for

Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Mathieu Blondel
On Thu, Mar 26, 2015 at 5:49 PM, Artem barmaley@gmail.com wrote: 1. Right, forgot to add that parameter. Well, I can apply an RBF kernel to get a similarity matrix from a distance matrix inside transform. 2. Usual transformer returns neither distance, nor similarity, but transforms the

Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Artem
Hm, but similarity-based clustering works with inter-data similarities, doesn't it? The result's shape would be like [n_samples_in_transform, n_samples_in_train] which is not what we want. On Thu, Mar 26, 2015 at 12:36 PM, Mathieu Blondel math...@mblondel.org wrote: Something like this: class

Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Mathieu Blondel
- Spectral clustering use similarities rather than distances and needs affinity=precomputed (otherwise, it assumes that X is [n_samples, n_features]) - Instead of duplicating each class, you could create a generic transformer that outputs a similarity / distance matrix from X. M. On Thu, Mar 26,

Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Artem
1. Right, forgot to add that parameter. Well, I can apply an RBF kernel to get a similarity matrix from a distance matrix inside transform. 2. Usual transformer returns neither distance, nor similarity, but transforms the input space so that usual Euclidean distance acts like the learned

Re: [Scikit-learn-general] GSoC 2015 Proposal: Multiple Metric Learning

2015-03-26 Thread Raghav R V
Hey Gael, I am sorry that I missed this comment of yours - 1. The design of multiple metric support is important and would bring an immense usability gain. But it will also require a framework of its own. I would say that this is to be considered in a second step. Could you expand a little

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Andreas Mueller
Hi Matthias. As far as I know, the main goal for TPE was to support tree-structured parameter spaces. I am not sure we want to go there yet because of the more complex API. On non-tree structured spaces, I think TPE performed worse than SMAC and GP. With regard to your code: There might be

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Christof Angermueller
, Christof On 20150326 16:08, Andreas Mueller wrote: Hi Matthias. As far as I know, the main goal for TPE was to support tree-structured parameter spaces. I am not sure we want to go there yet because of the more complex API. On non-tree structured spaces, I think TPE performed worse than SMAC and GP

Re: [Scikit-learn-general] Proposal for GSoC: Dimensionality reduction and features selection

2015-03-26 Thread Luca Puggini
Dear all, I have updated the proposal https://docs.google.com/document/d/1nnrAsEfkXpGRlc_PMEeuUUQ1ZcNMfy-7M-dZcIVp2lU/edit?usp=sharing following your advices. I have reduced the number of proposed algorithms and I have tried to explain better why we need them and how we can implement them. The

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Andreas Mueller
the current interface to make it easier to optimize any learner? Christof On 20150326 20:02, Christof Angermueller wrote: Hi Andy and others, I revised my proposal (https://docs.google.com/document/d/1bAWdiu6hZ6-FhSOlhgH-7x3weTluxRfouw9op9bHBxs/edit?usp=sharing) and submitted it to melange

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Christof Angermueller
interface to make it easier to optimize any learner? Christof On 20150326 20:02, Christof Angermueller wrote: Hi Andy and others, I revised my proposal (https://docs.google.com/document/d/1bAWdiu6hZ6-FhSOlhgH-7x3weTluxRfouw9op9bHBxs/edit?usp=sharing) and submitted it to melange. Can you have

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Christof Angermueller
about how to define parameters. Thanks for you suggestions, Christof On 20150326 16:08, Andreas Mueller wrote: Hi Matthias. As far as I know, the main goal for TPE was to support tree-structured parameter spaces. I am not sure we want to go there yet because of the more complex API. On non

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Andy
, I can not tell you if I will use ParamSklearn to define hyperparameters. Maybe I will come back to you when I think for carefully about how to define parameters. Thanks for you suggestions, Christof On 20150326 16:08, Andreas Mueller wrote: Hi Matthias. As far as I know, the main goal

Re: [Scikit-learn-general] GSoC2015 Improve GMM

2015-03-26 Thread Wei Xue
Hi, Gaƫl and group I really appreciate your comments. You are right. I'd better to stand on the shoulders of giants rather than build all things from scratch. I went through the 120+ comments on the very very initial PR #116 in 2011, https://github.com/scikit-learn/scikit-learn/pull/116. It is

Re: [Scikit-learn-general] Kernel PCA .fit() Failing Silently

2015-03-26 Thread Gael Varoquaux
It would be nice to do something else instead of crash and burn, but for the moment that's on the user. I think that in recent Python versions segfault can be captured. -- Dive into the World of Parallel Programming

Re: [Scikit-learn-general] Question regarding the list of topics for GSoC 2015

2015-03-26 Thread Vinayak Mehta
Sorry for the late reply, my internet connection failed me. I've seen only cross validation being the problem for semi-supervised learning, on the issue tracker. Would someone else like to discuss about this? So, should I scrap the self-taught learning algorithm from the proposal? Also, I'm

Re: [Scikit-learn-general] [GSoC] Metric Learning

2015-03-26 Thread Artem
Sorry, apparently I clicked reply and my previous message went to Mathieu only. Repeat them here: In case of vector y there's no other way, but to assume transitivity. Which is not general enough, but should work in a classification setting. After all, many of these methods are designed to aid

Re: [Scikit-learn-general] GSoC2015 Hyperparameter Optimization topic

2015-03-26 Thread Matthias Feurer
Dear Christof, dear scikit-learn team, This is a great idea, I highly encourage your idea to integrate Bayesian Optimization into scikit-learn since automatically configuring scikit-learn is quite powerful. It was done by the three winning teams of the first automated machine learning