Hi all,

I've just released an R package to CRAN that creates pretty looking word
clouds. I think it makes a good minimal example of how to prototype an
algorithm in R, and then bring the performance bottleneck down to c++ to
improve speed.


An example:

>install.packages("wordcloud",repos="http://cran.r-project.org",type="source";)
>library(tm)
>data(crude)
>crude <- tm_map(crude, removePunctuation)
>crude <- tm_map(crude, function(x)removeWords(x,stopwords()))
>tdm <- TermDocumentMatrix(crude)
>m <- as.matrix(tdm)
>v <- sort(rowSums(m),decreasing=TRUE)
>d <- data.frame(word = names(v),freq=v
+ )
>library(wordcloud)
Loading required package: Rcpp
>#using c++ to help layout the words
>system.time(wordcloud(d$word,d$freq,scale=c(8,.1),min.freq=0))
  user  system elapsed
 9.979   0.049   9.878
>#using R code to do the same layout
>system.time(wordcloud(d$word,d$freq,scale=c(8,.1),min.freq=0,use.r.layout=T))
  user  system elapsed
151.919   0.716 146.737

Cheers,
Ian

_______________________________________________
Rcpp-devel mailing list
Rcpp-devel@lists.r-forge.r-project.org
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/rcpp-devel

Reply via email to