Re: [Scikit-learn-general] Possible code contribution (Poisson loss)

Andreas Mueller Tue, 28 Jul 2015 11:47:50 -0700

I'd be happy with adding Poisson loss to more models, thought I think itwould be more natural to first add it to GLM before GBM ;)If the addition is straight-forward, I think it would be a nicecontribution nevertheless.1) for the user to do np.exp(gbmpoisson.predict(X)) is not acceptable.This needs to be automatic. It would be best if this could be done in aminimally intrusive way.


2) I'm not sure, maybe Peter can comment?

3) I would rather contribute sooner, but other might thing differently.Silently ignoring sample weights is not an option, but you can error ifthey are provided.


Hth,
Andy

On 07/23/2015 08:52 PM, Peter Rickwood wrote:

Hello sklearn developers,
I'd like the GBM implementation in sklearn to support Poisson loss,and I'm comfortable in writing the code (I have modified my localsklearn source already and am using Poisson loss GBM's).
The sklearn site says to get in touch via this list before making acontribution, so is it worth me to submitting something along theselines?
If the answer is yes, some quick questions:
1) The simplest implementation of poisson loss GBMs is to work inlog-space (i.e. the GBM predicts log(target) rather than target), andrequire the user to then take the exponential of those predictions.So, you would need to do something like:
          gbmpoisson = sklearn.ensemble.GradientBoostingRegressor(...)
          gbmpoisson.fit(X,y)
          preds = np.exp(predict(X))
I am comfortable making changes to the source for this to work, butI'm not comfortable changing any of the higher-level interface to dealautomatically with the transform. In other words, other developerswould need to either be OK with the GBM returning transformedpredictions in the case where "poisson" loss is chosen, or would needto change code in the 'predict' function to automatically do thetransformation is poisson loss was specified. Is this OK?2) If I do contribute, can you advise what the best tests are totest/validate GBM loss functions before they are considered to 'work'?
3) Allowing for weighted samples is in theory easy enough toimplement, but is not something I have implemented yet. Is it betterto contribute code sooner that doesn't handle weighting (i.e. justignores sample weights), or later that does?
Cheers, and thanks for all your work on sklearn. Fantastic tool/library,



Peter







------------------------------------------------------------------------------


_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------

_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] Possible code contribution (Poisson loss)

Reply via email to