Hi,
Try:
 FILELIST <- list.files()
FILELIST 
#[1] "FILE.csv"     "FILE.XXX.csv" "FILE.YYY.xls"

  sub("(.*)\\..*$", "\\1", basename(FILELIST))
#[1] "FILE"     "FILE.XXX" "FILE.YYY"


A.K.


On Wednesday, January 15, 2014 7:35 PM, Fisher Dennis <fis...@plessthan.com> 
wrote:
R 3.0.2
OS X

Colleagues

I am writing code to read a large number of files in a particular folder.  In 
some situations, there may be two versions of the file with different 
extensions, e.g.:
    FILE.csv
    FILE.xls
I extracted the portion before the extension with:
    sub("\\..*$", "", basename(FILELIST))
then used 
    duplicated
to find duplicates.  All was well until I encountered files named:
    FILE.XXX.csv
    FILE.YYY.xls

My regular expression extracted only the “FILE” portion of the text and claimed 
that the filenames (without the extensions) matched.  Can someone provide me 
with the appropriate regular expression to deal with this?  Thanks.

Dennis


Dennis Fisher MD
P < (The "P Less Than" Company)
Phone: 1-866-PLessThan (1-866-753-7784)
Fax: 1-866-PLessThan (1-866-753-7784)
www.PLessThan.com

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to