Hi Maryam

Currently, no tree based methods have a partial fit method. We are
currently working on expanding the tree module, you can see our checklist
here; https://github.com/scikit-learn/scikit-learn/issues/5212

There are many methods to reduce the dimensionality of data, if you are
using high dimensional data, including Gaussian random projections or LSH
for continuous data, and collapsing samples into higher weight samples for
discrete data. If you provide more information about your use case, I may
be able to be of more help.

Jacob

On Wed, Sep 30, 2015 at 5:12 AM, Maryam Tavakol <maryam.tavakol...@gmail.com
> wrote:

> Dear all,
>
> I am using Gradient Boosting Classifier from scikit-learn for a huge set
> of data. Unfortunately, the method loads the whole data into memory (around
> 45 GBs!). As it is not very easy to modify the code to stream data, is
> there any other way to make it scalable?
>
> Best Regards,
> Maryam Tavakol
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to