Re: differences between groups/treatments ?

Donald Burrill Tue, 20 Jun 2000 13:54:07 -0700
On Tue, 20 Jun 2000, Murtagh wrote:

> Firstly, thank you for your comments. Am I right in saying that the two
> (equivalent) options I have are:
> 
> 1.    ANOVA
> 
> Yijk = mew + Ai + Bj + ABij + Eijk
> 
> Ai:   a fixed factor representing the treatments (2 levels)
> Bj:   a fixed factor representing prior perfromance (2 levels)
> ABij: an interaction between Ai and Bj
> Yijk: the score of the kth child who received treatment i and is from 
>               group j 
> Eijk: random error
> 
> I suspect that this model is inapporpriate, as the Eijk term will represent
> between subjects (children) variation, which is not usually included in the
> estimate of random error.

I do not understand this comment.  What source(s) of random error exist 
in this design APART from variation between subjects within cells?  
Between-subjects variation (as residuals from the model) defines the 
standard error-variance term against which the variability in the 
systematic effects is tested.
 
> 2.    MLR
> 
> Y = Bo + B1*X1 + B2*X2 + B3*X3 + E
> 
> X1:   prior performance (0 => weak, 1 => strong)
> X2:   treatment (0 => treament A, 1 => treatment B)
> X3:   treatment*prior performance
                -- hence with the coding shown for X1 and X2,  1 => 
strong prior performance and treatment B, 0 => all other conditions.

And  E = Eijk of the ANOVA model.  B1 is a straightforward function 
(depending on the coding of X1, of course) of the Ai in the ANOVA model, 
B2 of the Bj (and depends on the coding of X2), and B3 of Ai, Bj, and 
ABij. 
 
> I appreciate that prior performance is probably better considered as a
> continuum, rather than a dichotomy.

_I_ would consider it so.  In fact, the first thing I'd do is ask for 
scatterplots of post-performance vs. pre-performance for all the cells 
in the design I was considering.  (In what you've described, that's two 
cells.)  THEN decide whether it appeared to make better sense to divide 
the continuum into two (or more) pieces, or to model it AS a continuum, 
possibly with non-linear functions.

> >> 1.  If there are children of different sexes, you may be able to 
> >> consider a three-way design, although I suspect it would be 
> >> unbalanced, which (I also suspect!) may induce serious difficulties 
> >> for you.

> You mean that there would not be the same numbers in each group? 

                Yes.

> I can't see why this should cause problems, but then that's probably 
> due to my relative ignorance of linear models!

Doesn't cause problems in one-way designs.  But in 2-way designs (let 
alone 3-way, 4-way, ...) unequal  n's  induce association of some kind 
between the design factors.  People who do multiple regression don't have 
much problem with this, it's their normal situation;  but people who try 
to do formal ANOVA design-of-experiments (and are therefore accustomed to 
the notion that the factors are mutually independent (and therefore are 
orthogonal)) are sometimes boggled by (1) the fact that the sums of 
squares for the several sources of variation do not simply add to the 
total sum of squares about the grand mean, or (2) the fact that the 
sums of squares reported depend on the order in which the factors are 
considered.  And many of the standard packages for doing multi-factor 
ANOVA use algorithms that require the design to be balanced. 
 (A GLM -- general linear model -- program does not usually have such 
constraints, and may even produce output patterned after the form of a 
standard balanced ANOVA, but one needs to be aware of (1) and (2) above.) 

> >> 2.  Your Performance information you have chosen to dichotomize,
> >> although it is presumably (quasi-)continuous to start with.  You 
> >> might find out something useful by treating it as a continuous 
> >> predictor rather than as a dichotomy:  in effect carrying out an 
> >> analysis of covariance with pre-treatment reading score as the 
> >> covariate, whether you used an "Analysis of Covariance" program or 
> >> a "Multiple Regression" program or a "General Linear Model" (GLM) 
> >> program to do the arithmetic. 
> 
> Presumably, this could achieved by simply using the pre-treatment score 
> itself (rather than 0 or 1) for the value of X1 in the suggested MLR 
> model above?
                        Right. 
 And if the pre-post relationship should turn out to be detectably 
nonlinear, you can substitute some candidate nonlinear function(s) of X1 
and see if that helps.

There may be nonlinearity to be EXPECTED:  in the nature of a reading 
test, there is a highest possible score (all items right, e.g.) and a 
lowest possible score (no items right, e.g.).  Students who perform well 
pre-treatment cannot have change scores that would put them above the 
highest possible score at post-treatment;  so it would not be surprising 
if (a) change correlates negatively with pre-treatment, (b) post scores 
were censored at the maximum (and negatively skewed), (c) pre scores were 
censored at the minimum (and positively skewed), and/or (d) the post vs. 
pre scatterplot showed curvature at one end or the other (or both).

One way of dealing with nonlinearity, in the absence of strong theory 
that would predict a particular form of nonlinear function, is to divide 
the subjects into groups based on pre-treatment performance (and on the 
observed post-vs.-pre relationship);  the optimum number of groups might 
not be two, however.
                        -- DFB.
 ------------------------------------------------------------------------
 Donald F. Burrill                                 [EMAIL PROTECTED]
 348 Hyde Hall, Plymouth State College,          [EMAIL PROTECTED]
 MSC #29, Plymouth, NH 03264                                 603-535-2597
 184 Nashua Road, Bedford, NH 03110                          603-471-7128  


===========================================================================
This list is open to everyone.  Occasionally, less thoughtful
people send inappropriate messages.  Please DO NOT COMPLAIN TO
THE POSTMASTER about these messages because the postmaster has no
way of controlling them, and excessive complaints will result in
termination of the list.

For information about this list, including information about the
problem of inappropriate messages and information about how to
unsubscribe, please see the web page at
http://jse.stat.ncsu.edu/
===========================================================================
Re: differences between groups/treatments ?

Reply via email to