Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-15 Thread Gilles Louppe
-general] Access data arriving at leaf nodes To: scikit-learn-general scikit-learn-general@lists.sourceforge.net Message-ID: CAAkaFLUB_ApLWGosUovxfEoEi34bcw-ePke0TBCKF3NrQpF=u...@mail.gmail.com Content-Type: text/plain; charset=utf-8 What do you mean by all the values that make up a leaf node

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-15 Thread Olivier Grisel
It can happen when you reach max_depth, or for regression tasks if all the samples in the lead of exact same target value. Hum. I meant: It can happen when you reach max_depth, or if all the samples in the leaf had the exact same target value for regression tasks (or belong to the same class

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-15 Thread M Asad
= KernelDensity(kernel='gaussian', bandwidth=0.2).fit(leafNodeSamples) Muhammad Date: Wed, 15 Oct 2014 08:21:45 +0200 From: Gilles Louppe g.lou...@gmail.com Subject: Re: [Scikit-learn-general] Access data arriving at leaf nodes To: scikit-learn-general scikit-learn-general@lists.sourceforge.net

[Scikit-learn-general] Access data arriving at leaf nodes

2014-10-14 Thread M Asad
Hi, I am kind of new to scikit, however I have learned a alot of things now. I am using scikit.ensemble.RandomForestRegressor to train on a data and predict using some input samples later. What I am trying to do now is to access the actual values that make up each leaf node. I have managed to

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-14 Thread Joel Nothman
What do you mean by all the values that make up a leaf node? If you mean all the samples, isn't apply sufficient? On 15 October 2014 06:20, M Asad masad@gmail.com wrote: Hi, I am kind of new to scikit, however I have learned a alot of things now. I am using

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-14 Thread Olivier Grisel
2014-10-14 15:20 GMT-04:00 M Asad masad@gmail.com: Hi, I am kind of new to scikit, however I have learned a alot of things now. I am using scikit.ensemble.RandomForestRegressor to train on a data and predict using some input samples later. What I am trying to do now is to access the

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-14 Thread M Asad
Muhammad Date: Wed, 15 Oct 2014 07:59:09 +1100 From: Joel Nothman joel.noth...@gmail.com Subject: Re: [Scikit-learn-general] Access data arriving at leaf nodes To: scikit-learn-general scikit-learn-general@lists.sourceforge.net Message-ID: CAAkaFLUB_ApLWGosUovxfEoEi34bcw-ePke0TBCKF3NrQpF

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-14 Thread Olivier Grisel
2014-10-14 18:59 GMT-04:00 M Asad masad@gmail.com: I am not sure if there is already a method to get this but I have read docs and there doesnt seem to be any. Please correct me if I am wrong. Actually I am trying to get probability distribution at each leaf node, as done in the book

Re: [Scikit-learn-general] Access data arriving at leaf nodes

2014-10-14 Thread Joel Nothman
in range(0, index.shape[1]): leafVals[j,i] = forestClf.estimators_[i].tree_.value[index[j,i] Many thanks in advance Muhammad Date: Wed, 15 Oct 2014 07:59:09 +1100 From: Joel Nothman joel.noth...@gmail.com Subject: Re: [Scikit-learn-general] Access data arriving at leaf nodes