I am helping a graduate student with her analysis of the diameters of 
cultured mammalian cells and she is looking at the difference between two 
factors:  'days of incubation' and 'initial plating density.'  She does the 
treatments and then measures the diameters of the cells from images 
obtained from a microscope.

She chose as 'Days of Incubation' 2, 5, and 9 days.  For 'Initial Plating 
Density' she chose 10 per cm2 (cm2=centimeter squared), 100/cm2, and 
1000/cm2.

When she collected the data, she did not have equal numbers of data 
(measured diameters) for all levels.  I provide the count of each dataset 
in the table below (the table has more meaning if your nntp client uses a 
fixed pitch font):


 Counts of data (n) for each dataset:

                         Initial Plating Density
               --------------------------------------------
Days of       |   10/cm2      |   100/cm2    |  1000/cm2   |
Incubation    |               |              |             |
--------------+---------------+--------------+-------------+
              |               |              |             |
      2       |    n = 0      |    n = 11    |    n = 0    |
              |               |              |             |
--------------+---------------+--------------+-------------+
              |               |              |             |
      5       |    n = 22     |    n = 38    |    n = 31   |
              |               |              |             |
--------------+---------------+--------------+-------------+
              |               |              |             |
      9       |    n = 19     |    n = 142   |    n = 72   |
              |               |              |             |
--------------+---------------+--------------+-------------+

As you can see, in some cases she did not get any data at all for a 
particular treatment (2-days at 10/ and 1000/cm2), and in other cases, she 
outdid herself in collecting more diameters than she really need (day 9 at 
100/cm2, and probably 1000/cm2).

I cranked through the 2-way ANOVA on my Excel spreadsheet (I did not use 
Microsoft's bundled analytical tool, and I don't have SPSS or other 
handholding tool) and got these:

===============================================================
Source of                                       
Variation             SS      df      MS        F        p
------------      ---------- ----- -------- --------- ---------
Densities                46.937     2   23.48    0.255      0.775
Days                   11.828     2    5.914   0.0642     0.938
Days x Densities           16.688     2    8.344   0.0906     0.913
Error               30196.541   328     92.06
Total             39189.826   334
===============================================================

This is without having tested the assumptions required for ANOVA in 
general.  With the grossly unequal sample sizes, it is clearly essential to 
test for normality and variance homogeneity.  These are my tests for 
variance homogeneity:

===============================================================
Levene's
              SS      df     MS         F          p
            -------  ----- -------- --------  ----------
between      1170.5     6   195.1        5.59      0.000015
within      11450.0   328    34.91
total       12620.6   334
===============================================================

===========================================================================
Bartlett's
             2d-10    5d-10    5d-100   5d-1000   9d-10   9d-100   9d-1000
             ------  -------  -------  --------  ------- -------- ---------
SS[i]         344.10  2421.69  3063.46   714.20  2009.04 18539.90   3104.15
df             10       21       37       30       18      141        71
df * ln(VAR)   35.38    99.70   163.41    95.10    84.87   687.93    268.22
1 / df          0.10     0.05     0.03     0.03     0.06     0.01      0.01

pooled Var        92.06                                                         
LN(pooled Var)     4.52
B                 48.76     (Bartlett's T statistic numerator)
C                  1.03     (Bartlett's T statistic denominator)
T                 47.28     (Bartlett's T statistic)
chi-square [T,3]   p <<<< 0.0001
===========================================================================

These after throwing out one outlier.

I don't have Sokal & Rohlf's stat tables appendix to look up NED values for 
Q-Q plots to check for normality, so got stuck there.

My grad student input her data into SPSS (for Win) and its analysis 
reported that there was a significant difference in the factor for initial 
plating density (something she was hoping for), and no sig diff for 
anything else, but SPSS did not report what "correction" it used.

My question:

What 2-Way ANOVA tool do I use for exploring differences in these 
treatments if the variances between groups is clearly not equal 
(homogeneous)?

I am reading throught Sokal & Rohlf's 3rd Edition and have trekked off on 
possibly doing ANCOVA on the data, and I have yet to see what J. H. Zar's 
4th Edition will have me doing.

Thanks.
.
.
=================================================================
Instructions for joining and leaving this list, remarks about the
problem of INAPPROPRIATE MESSAGES, and archives are available at:
.                  http://jse.stat.ncsu.edu/                    .
=================================================================

Reply via email to