Gentlemen,

I agree with both of you. Several correspondents had already pointed out that a
logistic regression approach would be more appropriate in this situation.
However, I was trying to steer the discussion in a slightly different direction,
which I suspect may be the subject of Gerhard's interest (and of mine, but
Gerhard can speak for himself). Perhaps I should rephrase the question to read
'What are the consequences of performing an analysis on a discreet dependent
variable as if it were continuous?' And I'm thinking of situations other than
just linear/logistic regression. For example, in an artificial reproduction
situation, the outcome may be clearly discreet (e.g. the number of IVF attempts)
and numerically small enough for the discreteness to be (potentially) important
(most women only undergo relatively few attempts before dropping out/changing
therapy). Not atypically, such studies are analysed using a Cox model, a
continuous outcome model, using the number of attempts as 'survival time'. How
valid are the conclusions from such an analysis? Under what circumstances does
the analysis go up in smoke? etc. etc.

Regards

Joseph

Jerry Dallal wrote:

> Herman Rubin wrote:
> >
> > In article <[EMAIL PROTECTED]>,
> > Joseph McDonnell  <[EMAIL PROTECTED]> wrote:
> > >I may be wrong, but I thought that Gerhard was asking something like "If I
> > >perform a linear regression but with a dichotomous dependent variable, do
> > >I get 'garbage' results?"
> >
> > The results must be at least partly garbage.  We can
> > consider the dichotomous variable to be 0 or 1, and,
> > using expectation, should interpret an answer between
> > 0 and 1 as a probability.
> >
> > However, what meaning can be given to <0 or >1?  The
> > TRUE "linear regression" does not give the conditional
> > expected value of the dependent random variable given
> > the independent random variables, here the probability
> > of 1, as it does in a linear model..
> >
> > >Joseph
>
> (1) Perhaps part of the problem is in the premise.
> The subject header specifies "ANOVA".
> With 2 groups, Student's t test applied to a binary outcome for
> large samples is, for all practical purposes, the square root of
> Pearson's chi-square statistic for homogeneity of proportions.
> I have some simulation results for more than two groups which
> suggests that ANOVA can be liberal in the 2xc case with nominal
> levels of 0.05 being closer to 0.03.
>
> (2) Regression of a 0/1 variable on a continuous predictor is
> Fisher's version of discriminant analysis.  What Professor Rubin
> says about the possiblity of misinterpretation is true, but that
> doesn't invalidate the technique for use as Fisher intended.



=================================================================
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
                  http://jse.stat.ncsu.edu/
=================================================================

Reply via email to