Re: [R] Count data in random Forest

Volker Bahn Tue, 06 May 2008 13:28:13 -0700

Hi Birgit,

I'm not sure that I understand your question. I'll try to answeranyways. Regression trees and therefore also RandomForests are invariantto monotonic transformations in the independent variables. There are nodistributional assumptions for the independent variables. The dependentvariable, however, is used to calculate the variances within the twogroups of cases that result from a split. Therefore, it would make senseto have the dependent variable follow the typical distributionalrequirements of least-squares driven models such as homoscedasity,symmetrical distribution etc. For count data a square roottransformation is often appropriate.


HTH

Volker

Birgit Lemcke wrote:

<div class="moz-text-flowed" style="font-family: -moz-fixed">HelloR-user!
I am running R 2.7.0 on a Power Book (Tiger). (I am still R andstatistics beginner)
I try to find the most important variables to divide my dataset asgiven in a categorical variable using randomForest.
Is randomForest() able to deal with count data?
Or is there no difference because only the ranks are used in the trees?

Thanks in advance

Birgit

Birgit Lemcke
Institut für Systematische Botanik
Zollikerstrasse 107
CH-8008 Zürich
Switzerland
Ph: +41 (0)44 634 8351
[EMAIL PROTECTED]

175 Jahre UZH
«staunen.erleben.begreifen. Naturwissenschaft zum Anfassen.»
MNF-Jubiläumsevent für gross und klein.
19. April 2008, 10.00 Uhr bis 02.00 Uhr
Campus Irchel, Winterthurerstrasse 190, 8057 Zürich
Weitere Informationen http://www.175jahre.uzh.ch/naturwissenschaft


</div>


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Count data in random Forest

Reply via email to