I have just had a glance of the source for decision forest, is a decision tree must be built against a dataset which must be loaded in memory?
Why can't a tree be built against a dataset resides on the disk as long as we can read it ? Regards, Xiaobo Gu On Wed, Jul 6, 2011 at 11:26 PM, Ted Dunning <[email protected]> wrote: > We really only have random forests. Tree methods are somewhat difficult to > parallelize and with large, sparse data, their advantages are not as > pronounced as with small data sets. > > On Wed, Jul 6, 2011 at 3:28 AM, Xiaobo Gu <[email protected]> wrote: > >> There is also a "Random Forests ", got more confused, can someone >> explain them to me please. >> >> Regards, >> >> Xiaobo Gu >> >> On Wed, Jul 6, 2011 at 6:21 PM, Xiaobo Gu <[email protected]> wrote: >> > And what's the progress of "Partial Implementation" of Decision forest >> > now, is it still in progress? >> > >> > >> > On Wed, Jul 6, 2011 at 6:11 PM, Xiaobo Gu <[email protected]> >> wrote: >> >> Hi, >> >> >> >> I have known the classic decision tree algorithm in traditional tools >> >> such as SPSS, but not so familiar with Decision forest in Mahout, can >> >> we treat them the same? >> >> >> >> Regards, >> >> >> >> Xiaobo Gu >> >> >> > >> >
