Re: [scikit-learn] Missing data and decision trees

Jeff Thu, 13 Oct 2016 11:25:10 -0700

I ran into this several times as well with scikit-learn implementationof GBM. Look at xgboost if you have not already (is there someone outthere that hasn't ? :)- it deals with missing values in the predictorspace in a very eloquent manner.


http://xgboost.readthedocs.io/en/latest/python/python_intro.html


https://arxiv.org/abs/1603.02754


Jeff



On 10/13/2016 2:14 PM, Stuart Reynolds wrote:

I'm looking for a decision tree and RF implementation that supportsmissing data (without imputation) -- ideally in Python, Java/Scala orC++.
It seems that scikit's decision tree algorithm doesn't allow this --which is disappointing because its one of the few methods that shouldbe able to sensibly handle problems with high amounts of missingness.
Are there plans to allow missing data in scikit's decision trees?
Also, is there any particular reason why missing values weren'tsupported originally (e.g. integrates poorly with other features)
Regards
- Stuart


_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] Missing data and decision trees

Reply via email to