I have a very long list of strings. Each string actually contains multiple
values separated by a semi-colon. I need to turn each string into a vector
of the values delimited by the semi-colons. I know I can do this very
laboriously by using loops, nchar, and substr, but it is terribly slow. Is
there a basic R function that handles this situation? If not, is there
perhaps a faster way to do it than I currently am, which is to lapply the
following function? Thanks, Mark

#######################################################################################
string.tokenizer.func<-function(string, separator){
  new.vec<- NULL
  newString<- ""
  if(is.null(string)) {new.vec<-""} else {
    for(i in 1:(nchar(string) + 1)){
      if(substr(string, i, i) == separator){
        new.vec<-c(new.vec,newString)
        newString <- ""
      } else {
        newString<-paste(newString, substr(string, i, i), sep="")
      }
    }
    new.vec<-c(new.vec,newString)
  }
  new.vec
}
------------------------------------------------------------
Mark W. Kimpel MD ** Neuroinformatics ** Dept. of Psychiatry
Indiana University School of Medicine

15032 Hunter Court, Westfield, IN 46074

(317) 490-5129 Work, & Mobile & VoiceMail
(317) 399-1219 Home
Skype: mkimpel

******************************************************************

        [[alternative HTML version deleted]]

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Reply via email to