Re: [Scikit-learn-general] Dropping Python 2.6 compatibility

2016-01-04 Thread Dale Smith
My own opinion, after reading the link in the original message in this thread, is to drop Python 2.6 support. Perhaps Python Weekly and LWN will report on this to increase visibility. As a follow-up, I'd like to point out http://www.snarky.ca/why-python-3-exists Dale Smith, Ph.D.

Re: [Scikit-learn-general] sklearn.preprocessing.normalize does not sum to 1

2015-12-17 Thread Dale Smith
Ryan, did you try passing the arrays, as they are, to np.random.choice? Do you get what you expect? Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA 30305 -Original Message

Re: [Scikit-learn-general] sklearn.preprocessing.normalize does not sum to 1

2015-12-17 Thread Dale Smith
completely foreseeable. And perhaps someone on the numpy mailing list could help. Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.795.7221 Nexidia Corporate | 3565 Piedmon

Re: [Scikit-learn-general] Utility of random_state parameter for decision trees

2015-10-16 Thread Dale Smith
I am studying Gilles Louppe's dissertation, which contains the best explanation for various properties of tree methods. If you want to know more, I would start here. http://www.montefiore.ulg.ac.be/~glouppe/pdf/phd-thesis.pdf Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 400

Re: [Scikit-learn-general] PyCon 2016 scikit-learn tutorial

2015-10-05 Thread Dale Smith
worth having is whether there are any emerging best practices in machine learning, statistics, etc. Since conferences have parallel tracks, it's possible that participants can't get to all talks of interest. Having these talks linked off the scikit-learn web page would be quite va

Re: [Scikit-learn-general] PyCon 2016 scikit-learn tutorial

2015-10-05 Thread Dale Smith
ystem? Why or why not? I haven't seen any of these issues addressed at all, but they are important parts of properly applying machine learning. Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two,

Re: [Scikit-learn-general] New version of "scipy lecture notes"

2015-09-29 Thread Dale Smith
o a former classmate of mine who is starting to teach statistical learning out of a mathematics department. Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA 30305 -Origin

Re: [Scikit-learn-general] GridSearchCV using too many cores?

2015-09-24 Thread Dale Smith
, even with no one else using a 100 gb 24 core Windows box. I can create some reproducible code if anyone has time to work on it. Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.79

Re: [Scikit-learn-general] Persisting models

2015-08-20 Thread Dale Smith
Package sklearn_pmml appeared on github: https://github.com/alex-pirozhenko/sklearn-pmml It's still in the early stages. I have yet to experiment with it, and I don't think it supports pmml import. Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia

Re: [Scikit-learn-general] Question on the code for Decision Trees

2015-08-13 Thread Dale Smith
Andreas, I tried to compile the package on Windows and didn't succeed. I gave up since I could not get the dependencies to compile. Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA

Re: [Scikit-learn-general] Question on the code for Decision Trees

2015-08-13 Thread Dale Smith
sklearn-compiledtrees is not usable on Windows without some work. I didn't have time to get it to work. Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA 30305 -Original Me

Re: [Scikit-learn-general] Added sample_weight to RFECV.fit but not sure how to test the change

2015-07-23 Thread Dale Smith
That is very interesting. Thanks. Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA

[Scikit-learn-general] [scikit-learn-general] Possible bug in RFECV.fit?

2015-07-22 Thread Dale Smith
0: ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all() Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.795.7221 Nexidia Corp

[Scikit-learn-general] Added sample_weight to RFECV.fit but not sure how to test the change

2015-07-22 Thread Dale Smith
test_metaestimators.py. I also reviewed the Contributing section of the documentation, the wiki, and searched the mailing list archive, but didn’t find anything relevant. Are there any other sources I should review? Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature

[Scikit-learn-general] Added sample_weight to RFECV.fit but not sure how to test the change

2015-07-15 Thread Dale Smith
test_metaestimators.py. I also reviewed the Contributing section of the documentation, the wiki, and searched the mailing list archive, but didn’t find anything relevant. Are there any other sources I should review? Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature

Re: [Scikit-learn-general] Estimators of RAKEL and (Ensemble) Classifier Chain for multilabel proposal

2015-07-10 Thread Dale Smith
-snippets http://scikit-learn.org/stable/developers/index.html But also review Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA 30305 -Original Message- From: Al [mailto:alain.pen

[Scikit-learn-general] PDF User's Guide for 0.16.2

2015-07-09 Thread Dale Smith
Hello, when can we expect a PDF version of the User’s Guide for 0.16.2? https://sourceforge.net/projects/scikit-learn/files/documentation/ Thanks very much. Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/&

Re: [Scikit-learn-general] Is it possible to specify the order of spliting in decision tree with scikit-learn?

2015-07-01 Thread Dale Smith
use case? Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.795.7221 Nexidia Corporate | 3565 Piedmont Road, Building Two, Suite 400 | Atlanta, GA 30305 [http://host.msga

Re: [Scikit-learn-general] Library of pre-trained models

2015-07-01 Thread Dale Smith
Apparently so; here is a python/cython implementation. http://rare-technologies.com/deep-learning-with-word2vec-and-gensim/ Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.79

[Scikit-learn-general] Responses to Digests

2015-06-29 Thread Dale Smith
nt. Please read http://www.catb.org/esr/faqs/smart-questions.html and read the user's guide on the scikit-learn web site. Perhaps you can find the answer to your question there. http://scikit-learn.org/stable/user_guide.html Dale Smith, Ph.D. Data Scientist ​ d. 404.495.7220 x 4008   

Re: [Scikit-learn-general] Warm_start on Random Forest Classifiers

2015-06-25 Thread Dale Smith
ees as an evaluation tool is mentioned in the paper http://projecteuclid.org/euclid.ssu/1257431567 Free for download. Dale Smith, Ph.D. Data Scientist ​ [http://host.msgapp.com/Extranet/96621/Signature%20Images/sig%20logo.png]<http://nexidia.com/> d. 404.495.7220 x 4008 f. 404.795.722

[Scikit-learn-general] RandomForestClassifier with warm_start and n_jobs

2015-06-24 Thread Dale Smith
.".format(n_trees)) forest.set_params(n_estimators=n_trees) forest.set_params(n_jobs=10) params = forest.get_params() forest.fit(X, y) error_rate.loc[i] = [n_trees, 1 - forest.oob_score_] sns.lmplot('Number of Trees', 'OOB Error', error_rate).savefig("test_w