Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-02 Thread Liaw, Andy
t; R> nc <- 14037 >> R> x <- matrix(runif(nr*nc), nr, nc) >> R> n.na <- round(nr*nc/10) >> R> x[sample(nr*nc, n.na)] <- NA >> R> system.time(x.fixed <- na.roughfix(x)) >>    user  system elapsed >>    8.44    0.39    8.85 >> R 2.11.1, randomForest 4.5-35

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-01 Thread Mike Williamson
5 AM, Liaw, Andy wrote: > > > >> You need to isolate the problem further, or give more detail about your > >> data. This is what I get: > >> > >> R> nr <- 2134 > >> R> nc <- 14037 > >> R> x <- matrix(runif(nr*nc), nr, nc)

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-01 Thread Hadley Wickham
<- round(nr*nc/10) >> R> x[sample(nr*nc, n.na)] <- NA >> R> system.time(x.fixed <- na.roughfix(x)) >>    user  system elapsed >>    8.44    0.39    8.85 >> R 2.11.1, randomForest 4.5-35, Windows XP (32-bit), Thinkpad T61 with 2GB >> ram. >> >

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-01 Thread Mike Williamson
face, not na.roughfix() >> itself. >> >> If that is your case, try doing the imputation beforehand and run >> randomForest() afterward; e.g., >> >> myroughfixed <- na.roughfix(mybigdata) >> randomForest(myroughfixed[list.of.predictor.columns], >> myr

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-01 Thread Liaw, Andy
ginal Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Mike Williamson Sent: Wednesday, June 30, 2010 7:53 PM To: r-help Subject: [R] anyone know why package "RandomForest" na.roughfix is so

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-01 Thread Mike Williamson
gt; randomForest(myroughfixed[list.of.predictor.columns], > myroughfixed[[myresponse]],...) > > HTH, > Andy > > -Original Message----- > From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] > On Behalf Of Mike Williamson > Sent: Wednesday, June 30,

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-07-01 Thread Liaw, Andy
ginal Message- From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On Behalf Of Mike Williamson Sent: Wednesday, June 30, 2010 7:53 PM To: r-help Subject: [R] anyone know why package "RandomForest" na.roughfix is so slow?? Hi all, I am using the package "rand

Re: [R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-06-30 Thread jim holtman
Use "Rprof" to determine where time is being spent. This might point out some problems in the code. On Wed, Jun 30, 2010 at 7:53 PM, Mike Williamson wrote: > Hi all, > >    I am using the package "random forest" for random forest predictions.  I > like the package.  However, I have fairly large

[R] anyone know why package "RandomForest" na.roughfix is so slow??

2010-06-30 Thread Mike Williamson
Hi all, I am using the package "random forest" for random forest predictions. I like the package. However, I have fairly large data sets, and it can often take *hours* just to go through the "na.roughfix" call, which simply goes through and cleans up any NA values to either the median (numer