I am working with a longitudinal data set in the long format. This data
set has three observations per grade level per year. Here are the first
10 rows of the data frame:

 

>tenn.dat[1:10,]

 

year  schid type grade gain  se new cohort

6  2001 100005    5     4 33.1 3.5   4      3

7  2002 100005    5     4 33.9 3.9   4      2

8  2003 100005    5     4 32.3 4.2   4      1

10 2001 100005    5     5 22.9 4.0   5      4

11 2002 100005    5     5 25.0 3.4   5      3

12 2003 100005    5     5  7.8 3.8   5      2

18 2001 100010    1     4 34.4 5.9   4      3

19 2002 100010    1     4 27.8 5.6   4      2

20 2003 100010    1     4 34.6 6.8   4      1

22 2001 100010    1     5 21.1 4.8   5      4

 

I need to create a new column in this data frame with the mean gain for
each grade by year and the sd for each grade by year.

 

So, I used tapply as follows:

 

tapply(tenn.dat[,5],tenn.dat[,c(1,4)],mean) and
tapply(tenn.dat[,5],tenn.dat[,c(1,4)],sd)  which produces exactly the
data I would like to attach in column 1 and 2 respectively. 

 

I am having a problem connecting this back with the corresponding rows
in the data frame.

 

If I used only one factor instead of two, I was successful connecting
this with the data frame using:

 

m.gain<-tapply(tenn.dat[,5],tenn.dat[,4],mean)

 

tenn.dat$m.gain<-m.gain[as.character(tenn.dat$grade)]

 

can anyone offer suggestions on a next step?

 

Thanks,

 

Harold

 

 


        [[alternative HTML version deleted]]

______________________________________________
[EMAIL PROTECTED] mailing list
https://www.stat.math.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Reply via email to