Re: [Scikit-learn-general] Question about naming a clustering algorithm

2013-09-11 Thread Robert Layton
In the interests of a decision, can I push for renaming to SingleLinkageCluster, and then I'll work with Gael on a solution to either introduce a threshold cut to his implementation, or choose some other path? - Robert On 9 September 2013 20:22, Robert Layton wrote: > I haven't yet compared ag

Re: [Scikit-learn-general] Project Best Practices

2013-09-11 Thread Skipper Seabold
On Wed, Sep 11, 2013 at 10:35 PM, David Reed wrote: > I apologize if this is a tad of topic, but I figured this was the best place > to ask if any. > > When presented with a new research idea or project, I consistently deal with > the issue of saving figures and results out to my current working d

[Scikit-learn-general] Project Best Practices

2013-09-11 Thread David Reed
I apologize if this is a tad of topic, but I figured this was the best place to ask if any. When presented with a new research idea or project, I consistently deal with the issue of saving figures and results out to my current working directory, and after working on that idea long enough I eventua

[Scikit-learn-general] Categorical values and decision tree classifier

2013-09-11 Thread yegle
Hi list, I'm a beginner in Machine Learning and trying to write a classifier using training set containing categorical values. From the document [1] I learned that I need to encode (vectorize) my categorical features in order to be learned by the classifier. So I uses `DictVectorizer` to do th

[Scikit-learn-general] Unlabelled and mislabelled data

2013-09-11 Thread Ark
I am currently using [TfidfVectorizer and SGDClassifier] for document classification with ~3000 categories. (n_samples, n_features) = (14000, 40) In my case the dataset the decision that a particular document belonged to a particular category was based upon human observation (very initial

[Scikit-learn-general] Fwd: Build failed in Jenkins: python-2.6-numpy-1.3.0-scipy-0.7.2 #2167

2013-09-11 Thread Olivier Grisel
The bagging tests are failing under old version of numpy / python. Can someone have a look at it? -- Forwarded message -- From: Date: 2013/9/11 Subject: Build failed in Jenkins: python-2.6-numpy-1.3.0-scipy-0.7.2 #2167 To: [email protected], [email protected]