My understanding of current Random Forrest has a certain level of improvement for running on Hadoop cluster from data splitting alignment perspective for better balanced CPU utilization. Regards,,, Y.Mandai
iPhoneから送信 On 2013/03/25, at 14:48, Ted Dunning <[email protected]> wrote: > I think that there are some others who could say more. > > On Mon, Mar 25, 2013 at 6:01 AM, Ey-Chih chow <[email protected]> wrote: > >> On Mar 24, 2013, at 1:00 AM, Ted Dunning wrote: >> >>> - random forest, sequential and parallel implementations, new versions >> are being developed, the current version may or may not be useful to you. >>> >> Can you elaborate the usefulness of the current version and features of >> the new versions? Thanks. >> >> Ey-Chih Chow >> >> >> On Mar 24, 2013, at 1:00 AM, Ted Dunning wrote: >> >>> You are correct to suspect that this page is substantially out of date. >>> >>> Currently, Mahout has the following classifiers: >>> >>> - stochastic gradient descent for logistic regression (SGD) with L_1 or >> L_2 regularization, sequential version only. These classifiers can be >> easily extended with other gradients and regularizers which should make >> linear SVM's easy to implement. >>> >>> - naive bayes, sequential and parallel implementations >>> >>> - random forest, sequential and parallel implementations, new versions >> are being developed, the current version may or may not be useful to you. >>> >>> There are a variety of other classifiers which are in various states of >> utility. >>> >>> On Mar 24, 2013, at 4:07 AM, Chidananda Sridhar wrote: >>> >>>> Hi, >>>> >>>> I am doing a class project on classification and want to use Mahout. I >> was >>>> searching for the classification algorithms already implemented in >> Mahout >>>> and came to this page: >>>> https://cwiki.apache.org/confluence/display/MAHOUT/Algorithms >>>> >>>> The webpage says that Online Passive >>>> Aggressive< >> https://cwiki.apache.org/confluence/display/MAHOUT/Online+Passive+Aggressive >>> is >>>> integrated and the rest of the classification algorithms are open or >>>> awaiting commit. Does the webpage have the latest information, or is it >> yet >>>> to be updated? Is "Online Passive Aggressive" the only algorithm I can >> use >>>> for now? On the other hand, I see that most of the clustering algorithms >>>> have been integrated. >>>> >>>> Thanks, >>>> Chidananda >>> >> >>
