Re: [Scikit-learn-general] Composite scores in grid_search.BaseSearchCV

2013-03-12 Thread Joel Nothman
Andreas wrote: > Sorry for being terse, I should be working. Yes, me too. I think this is the last commit you'll see from me for a while. > We really can not break backward compatibility. > One possibility is to have the old ``grid_scores_`` be the same as Backwards-compatible ``grid_scores_``

Re: [Scikit-learn-general] CountVectorizer in feature extraction is still slow

2013-03-12 Thread Roman Sinayev
Here's what I got so far: http://pastie.org/6464655 It's about 40% faster. I still need to add the fixed vocabulary option and parallelize. -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynam

Re: [Scikit-learn-general] Problem with Ward

2013-03-12 Thread Andreas Mueller
Hi Alex. It should be fully connected. I'll check again. Thanks. Andy On 03/12/2013 04:07 PM, Alexandre Gramfort wrote: > hi Andy, > > is your graph fully connected? ie one connected compotent? if not > you should tell the estimator. > > let me know if it works. > > Alex > > On Tue, Mar 12, 2013

Re: [Scikit-learn-general] Problem with Ward

2013-03-12 Thread Alexandre Gramfort
hi Andy, is your graph fully connected? ie one connected compotent? if not you should tell the estimator. let me know if it works. Alex On Tue, Mar 12, 2013 at 3:16 PM, Andreas Mueller wrote: > Hey everybody. > I have been trying to use Ward with a fixed connectivity matrix today > and ran int

[Scikit-learn-general] Problem with Ward

2013-03-12 Thread Andreas Mueller
Hey everybody. I have been trying to use Ward with a fixed connectivity matrix today and ran into some problems: File "/home/VI/staff/amueller/checkout/scikit-learn/sklearn/base.py", line 330, in fit_predict self.fit(X) File "/home/VI/staff/amueller/checkout/scikit-learn/sklearn/clus

Re: [Scikit-learn-general] Composite scores in grid_search.BaseSearchCV

2013-03-12 Thread Andreas Mueller
On 03/12/2013 02:05 PM, Joel Nothman wrote: > Andreas wrote: > >> In the meantime, did you have a look at >> https://github.com/scikit-learn/scikit-learn/pull/1742? > No, I hadn't, but now I've merged that (not a trivial merge), and > changed a couple of things a little. > > https://github.com/jnot

Re: [Scikit-learn-general] Composite scores in grid_search.BaseSearchCV

2013-03-12 Thread Joel Nothman
Andreas wrote: > In the meantime, did you have a look at > https://github.com/scikit-learn/scikit-learn/pull/1742? No, I hadn't, but now I've merged that (not a trivial merge), and changed a couple of things a little. https://github.com/jnothman/scikit-learn/tree/grid_search_more_info (6e71aeaf8

Re: [Scikit-learn-general] Different BernoulliNB coef matrix when using DictVectorizer vs FeatureHasher

2013-03-12 Thread Lars Buitinck
2013/3/12 Raj Arasu : > I am new to the "hashing trick" in general, but should I expect to get the > same coefficient matrix when training a BernoulliNB model using a > DictVectorizer versus a FeatureHasher as feature extractors? I am getting > different coefficient matrixes. No, you will most li

[Scikit-learn-general] Different BernoulliNB coef matrix when using DictVectorizer vs FeatureHasher

2013-03-12 Thread Raj Arasu
I am new to the "hashing trick" in general, but should I expect to get the same coefficient matrix when training a BernoulliNB model using a DictVectorizer versus a FeatureHasher as feature extractors? I am getting different coefficient matrixes. ---

Re: [Scikit-learn-general] Composite scores in grid_search.BaseSearchCV

2013-03-12 Thread Andreas Mueller
Hi Noel. Thanks for your input. Thinking about per-fold and per-parameter values is definitely a good idea. I didn't have time to go through your proposal in detail, will try to do asap. In the meantime, did you have a look at https://github.com/scikit-learn/scikit-learn/pull/1742? Cheers, Andy