Re: [Scikit-learn-general] random forest missing values

2014-09-21 Thread Gilles Louppe
Hi Luca, Missing values are currently not handled in any special way. Best, Gilles On 22 September 2014 07:38, Luca Puggini wrote: > Hi, > I was wondering what approach is used in RandomForestClassifier to dial with > missing values. > Are the samples split in 3 :x>=value, x > Let me know

[Scikit-learn-general] random forest missing values

2014-09-21 Thread Luca Puggini
Hi, I was wondering what approach is used in RandomForestClassifier to dial with missing values. Are the samples split in 3 :x>=value, x-- Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DS

[Scikit-learn-general] Issues for 1.0

2014-09-21 Thread Andy
Hi all. I remember that we had a couple of things we wanted to do for 1.0, but I didn't really find a good list. I know Gael mentioned the support for label transformations in pipelines and I want to get GridSearchCV work with CV objects. Is there an actual list somewhere, or are there issues? Sh

Re: [Scikit-learn-general] Sparse Gradient Boosting & Fully Corrective Gradient Boosting

2014-09-21 Thread Mathieu Blondel
On Sun, Sep 21, 2014 at 2:04 AM, Olivier Grisel wrote: > 2014-09-20 8:04 GMT-07:00 Mathieu Blondel : > > > > I recently re-implemented gradient boosting [2]. > > I am interested in your feedback in implementing trees with numba. Is > it easy to reach the speed the scikit-learn cython generated co

Re: [Scikit-learn-general] Sparse Gradient Boosting & Fully Corrective Gradient Boosting

2014-09-21 Thread Mathieu Blondel
On Sun, Sep 21, 2014 at 1:55 AM, Olivier Grisel wrote: > On a related note, here is an implementeation of Logistic Regression > applied to one-hot features obtained from leaf membership info of a > GBRT model: > > > http://nbviewer.ipython.org/github/ogrisel/notebooks/blob/master/sklearn_demos/In

Re: [Scikit-learn-general] Sparse Gradient Boosting & Fully Corrective Gradient Boosting

2014-09-21 Thread Mathieu Blondel
Hi Ken, On Sun, Sep 21, 2014 at 4:16 AM, c TAKES wrote: > > Understandable that scikit-learn wants to focus on more mature algorithms, > so perhaps I'll spend my efforts more on writing a python wrapper for > Johnson and Zhang's implementation of RGF, at least for now. Personally I > do think i