On 03/24/2010 06:08 AM, kathy_BJ wrote:

Thank you so much for help, Jim. But I didn't get any plot after I submit the
job except below, which indicating the number of lines for the two inputs.
What's the problem?

Read 3360 records
Read 3360 records

I know I didn't confuse you.

Okay, let's make up some data and produce a set of boxplots.

file1<-data.frame(
 col1=sample(c(050000000:052000000,220000000:229000000),1000),
 col2=sample(0:270,1000,TRUE),
 col3=runif(1000,50,80),col4=runif(1000,50,80),
 col5=sample(c("RED","GREEN","BLUE"),1000,TRUE))
file2<-data.frame(
 col1=sample(c(050000000:052000000,220000000:229000000),1000),
 col2=sample(0:270,1000,TRUE),
 col3=runif(1000,50,80),col4=runif(1000,50,80),
 col5=sample(c("RED","GREEN","BLUE"),1000,TRUE))
file1$col_group<-cut(file1$col2,breaks=c(0,20,50,70,271),
 right=FALSE)
file2$col_group<-cut(file2$col2,breaks=c(0,20,50,70,271),
 right=FALSE)
reds1<-file1$col5=="RED"
reds2<-file2$col5=="RED"
greens1<-file1$col5=="GREEN"
greens2<-file2$col5=="GREEN"
blues1<-file1$col5=="BLUE"
blues2<-file2$col5=="BLUE"
file1$col43<-file1$col4-file1$col3
file2$col43<-file2$col4-file2$col3
col_group11<-as.numeric(file1$col_group)==1
col_group12<-as.numeric(file1$col_group)==2
col_group13<-as.numeric(file1$col_group)==3
col_group14<-as.numeric(file1$col_group)==4
col_group21<-as.numeric(file2$col_group)==1
col_group22<-as.numeric(file2$col_group)==2
col_group23<-as.numeric(file2$col_group)==3
col_group24<-as.numeric(file2$col_group)==4
x11(width=9)
boxplot(file1$col43[reds1&col_group11],
 file2$col43[reds2&col_group21],
 file1$col43[reds1&col_group12],
 file2$col43[reds2&col_group22],
 file1$col43[reds1&col_group13],
 file2$col43[reds2&col_group23],
 file1$col43[reds1&col_group14],
 file2$col43[reds2&col_group24],
 main="Comparison of RED col4 and col3 differences across col2 groups",
 names=rep("",8))
require(plotrix)
staxlab(at=1:8,
 labels=paste(c("file1","file2"),rep(levels(file1$col_group),each=2)))

This should produce a set of 8 boxplots comparing the medians, etc. of the differences of col4 and col3 between file1 and file 2 for all of the entries with col5="RED" across the levels of col_group. I don't know whether this is what you want, but it might get you started.

Jim

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to