Someone supplied me with an SPSS datafile that caused a buffer overflow and then a crash when reading it in R. Unfortunately I can't supply the dataset at hand and I have a hard time reproducing it with a toy example. But I found at least 2 issues that might be related. I would like to know which of these are expected behavior, and which are bugs. I reproduced it on R 2.14.1 both on Ubuntu Linux and Windows 7...
Below some code. The files that are referenced in the code are available for download onĀ http://www.stat.ucla.edu/~jeroen/spss/ #load library library(foreign) #problem one: long string variable is converted to multiple variables. x <- read.spss("longstring.sav"); summary(x); #4 variables?? #problem two: use.labels does not deal correctly with duplicate labels and generates a bad factor. x <- read.spss("duplicate_labels.sav", use.value.labels=T); ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.