I have a data frame with 155,000 rows. One of the columns represents the user id (of which about 10,000 are unique). I am able to isolate 1000 of these user ids (stored in a list) that I want to eliminate from the data set, but I don't know of an efficient way to do this. Certainly this would be slow:
newdf<-df for(i in listofbadusers) { newdf<-subset(tmp,uid!=i) }
What about subsetting? See help("[").
One solution (not saying it is the optimal one):
newdf <- df[!(df$uid %in% listofbadusers), ]
Uwe Ligges
is there a better approach?
I guess I could use the opposite logic and use a list of good users and add their data to the new frame...
thanks, pete
______________________________________________ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help
______________________________________________ [EMAIL PROTECTED] mailing list https://www.stat.math.ethz.ch/mailman/listinfo/r-help
