On Mon, 11 Aug 2003 11:13:12 -0400, Rich Ulrich <[EMAIL PROTECTED]>
wrote:

>On Mon, 11 Aug 2003 09:35:19 -0400, Ken Butler
><[EMAIL PROTECTED]> wrote:
>
>> On Mon, 11 Aug 2003 12:07:35 +0200, Piotr Wi¶niewski <[EMAIL PROTECTED]>
>> wrote:
>> 
>> >t-Student test
>> >log-transformed data
>> >------------------------
>> >    n       mean    SD
>> >X1  30       0.26   0.46
>> >X2  6       -0.11   0.17
>> >
>> >p=0.17
>> 
>> Are you using the "pooled" t-test that assumes equal population
>> variances for the two groups?
>> 
>> If you are, this situation, where the smaller group also has the
>> smaller SD, can give very misleading P-values.
>
>That last comment is misleading, too.  It  implies 
>that the *other*  t-test is not  (approximately) 
>just as flawed.  Which it is.

If by "other t-test" you mean where the smaller sample has the larger
of two notably unequal SDs, then I agree (the P-value will then tend
to be too small).

But if you mean the Welch t-test (where equal variances are not
assumed, and the P-value is an approximation based on a choice of df
to make a t-distribution fit well), then my agreement is rather more
qualified. There are two (or more) issues:

- robustness: what happens to P-values when the populations are not
normal? The answer here is probably "bad things" whether you use the
pooled-variance or Welch tests.

- type I error probabilities when the populations are normal. My
understanding is that it's only the pooled test that suffers here,
something sort-of confirmed by a few texts and confirmed rather more
dramatically by a little simulation.

I generated random samples of sizes 30 and 6 from normal distributions
with SDs 0.46 and 0.17 (to echo the figures quoted above) and means
both 0. I first did 1000 runs using the pooled test, and found that:

I rejected at alpha=0.10  0.008 of the time
                    0.05  0.003 of the time
                    0.01  0.000 of the time

The P-values are way too big, and we are rejecting way too
infrequently. This is what I expected.

Compare this with the results of 1000 runs of the Welch test:

I rejected at alpha=0.10  0.094 of the time
                    0.05  0.045 of the time
                    0.01  0.011 of the time

There's no suggestion here of any systematic effect on P-values.

>I notice that the SDs are rather different for the
>log-transformed numbers, above, which does not
>lead one to regard the log fixing the matter.

Indeed. Though the original poster said (if I understand correctly)
that the log-transform made the data in the separate groups look more
normal-shaped, which is one step forward. (How you assess normality
with a sample size of 6 is another question.)

Cheers,
Ken.

-- 
Ken Butler, Lecturer (Statistics)
University of Toronto at Scarborough
butler (at) utsc.utoronto.ca
http://www.utsc.utoronto.ca/~butler
.
.
=================================================================
Instructions for joining and leaving this list, remarks about the
problem of INAPPROPRIATE MESSAGES, and archives are available at:
.                  http://jse.stat.ncsu.edu/                    .
=================================================================

Reply via email to