t; R> nc <- 14037
>> R> x <- matrix(runif(nr*nc), nr, nc)
>> R> n.na <- round(nr*nc/10)
>> R> x[sample(nr*nc, n.na)] <- NA
>> R> system.time(x.fixed <- na.roughfix(x))
>> user system elapsed
>> 8.44 0.39 8.85
>> R 2.11.1, randomForest 4.5-35
5 AM, Liaw, Andy wrote:
> >
> >> You need to isolate the problem further, or give more detail about your
> >> data. This is what I get:
> >>
> >> R> nr <- 2134
> >> R> nc <- 14037
> >> R> x <- matrix(runif(nr*nc), nr, nc)
<- round(nr*nc/10)
>> R> x[sample(nr*nc, n.na)] <- NA
>> R> system.time(x.fixed <- na.roughfix(x))
>> user system elapsed
>> 8.44 0.39 8.85
>> R 2.11.1, randomForest 4.5-35, Windows XP (32-bit), Thinkpad T61 with 2GB
>> ram.
>>
>
face, not na.roughfix()
>> itself.
>>
>> If that is your case, try doing the imputation beforehand and run
>> randomForest() afterward; e.g.,
>>
>> myroughfixed <- na.roughfix(mybigdata)
>> randomForest(myroughfixed[list.of.predictor.columns],
>> myr
ginal Message-
From: r-help-boun...@r-project.org
[mailto:r-help-boun...@r-project.org]
On Behalf Of Mike Williamson
Sent: Wednesday, June 30, 2010 7:53 PM
To: r-help
Subject: [R] anyone know why package "RandomForest" na.roughfix
is so
gt; randomForest(myroughfixed[list.of.predictor.columns],
> myroughfixed[[myresponse]],...)
>
> HTH,
> Andy
>
> -Original Message-----
> From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org]
> On Behalf Of Mike Williamson
> Sent: Wednesday, June 30,
ginal Message-
From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org]
On Behalf Of Mike Williamson
Sent: Wednesday, June 30, 2010 7:53 PM
To: r-help
Subject: [R] anyone know why package "RandomForest" na.roughfix is so
slow??
Hi all,
I am using the package "rand
Use "Rprof" to determine where time is being spent. This might point
out some problems in the code.
On Wed, Jun 30, 2010 at 7:53 PM, Mike Williamson wrote:
> Hi all,
>
> I am using the package "random forest" for random forest predictions. I
> like the package. However, I have fairly large
Hi all,
I am using the package "random forest" for random forest predictions. I
like the package. However, I have fairly large data sets, and it can often
take *hours* just to go through the "na.roughfix" call, which simply goes
through and cleans up any NA values to either the median (numer
9 matches
Mail list logo