Re: [R] how to skip certain rows when reading data
On Thu, 27 Jul 2006, [EMAIL PROTECTED] wrote: Dear all, I am reading the data using read.table. However, there are a few rows I want to skip. How can I do that in an easy way? Suppose I know the row number that I want to skip. Thanks so much! The easy way is to read the whole data frame and using indexing (see `An Introduction to R') to remove the rows you do not want to retain. E.g. to remove rows 17 and 137 mydf - read.table(...)[-c(17, 137), ] -- Brian D. Ripley, [EMAIL PROTECTED] Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] how to skip certain rows when reading data
Have a look at readTable() in the R.utils package. It can do quite a few thinks like reading subsets of rows, specify colClasses by column names etc. Implementation was done so that memory usage is as small as possible. Note the note on the help page: WARNING: This method is very much in an alpha stage. Expect it to change.. It should work though. Examples: # Read every forth row df - readTable(pathname, rows=seq(from=1, to=1000, by=4)); # Read only columns 'chromosome' and 'position'. df - readTable(pathname, colClasses=c(chromosome=character, position=double), defColClass=NULL, header=TRUE, sep=\t); # Read 'log2' data chromosome by chromosome chromosome - readTableIndex(pathname, indexColumn=3, header=TRUE, sep=\t) for (cc in unique(chromosome)) { rows - which(chromosome == cc); df - readTable(pathname, rows=rows, colClasses=c(log2=double), defColClass=NULL, header=TRUE, sep=\t); ... } Cheers Henrik On 7/27/06, Prof Brian Ripley [EMAIL PROTECTED] wrote: On Thu, 27 Jul 2006, [EMAIL PROTECTED] wrote: Dear all, I am reading the data using read.table. However, there are a few rows I want to skip. How can I do that in an easy way? Suppose I know the row number that I want to skip. Thanks so much! The easy way is to read the whole data frame and using indexing (see `An Introduction to R') to remove the rows you do not want to retain. E.g. to remove rows 17 and 137 mydf - read.table(...)[-c(17, 137), ] -- Brian D. Ripley, [EMAIL PROTECTED] Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UKFax: +44 1865 272595 __ R-help@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. __ R-help@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
[R] how to skip certain rows when reading data
Dear all, I am reading the data using read.table. However, there are a few rows I want to skip. How can I do that in an easy way? Suppose I know the row number that I want to skip. Thanks so much! __ R-help@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] how to skip certain rows when reading data
Just read them in and throw them away: read.table(myfile.dat, ...whatever...)[-c(10, 12), ] On 7/27/06, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Dear all, I am reading the data using read.table. However, there are a few rows I want to skip. How can I do that in an easy way? Suppose I know the row number that I want to skip. Thanks so much! __ R-help@stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.