A useful result is in the current Statistics in Medicine:

@Article{buy00r2,
  author =    {Buyse, Marc},
  title =    {$R^2$: {A} useful measure of model performance when
                  predicting a dichotomous outcome},
  journal =   SM,
  year =    2000,
  volume =   19,
  pages =   {271-274},
  note =   {Letter to the Editor regarding {\em Statistics in
                  Medicine} 18:375-384; 1999}
}



Milo Schield wrote:

> QUESTION:  What is the theoretical maximum value of R-sq ** when binary data
> (Y) is obtained from a simple linear model?
>
> The data is binary with Y values taken from a linear model going from 0 to 1
> over the range of X.
>
> The binary sequences of Y values are organized to minimize* the standard
> deviation around the model.
>
> TYPE REGRESSION            DISTRIBUTION OF X VALUES
> a.     OLS                                     linear
> b.     OLS                                     normal [width truncated at 6
> sigma?]
> c.     Logistic                                linear
> d.     Logistic                                normal [width truncated at 6
> sigma?]
>
> Based on some discrete trials, I get the following estimates for R-sq:
> a.  99%
> b.  16%
> c.  96%
> d.  16%
>
> * On the selection of binary Y values.  Suppose the X values are linearly
> distributed from 0 to 1 and the Model is Y=X.  In the discrete case with 100
> points, the first 5 would be all zeros and the last 5 would be all ones.  At
> the center, half the points would be zeroes and the other half would be
> ones.
>
> ** R^2 = (S^2 around mean   -   S^2 around model) / (S^2 around mean)

--
Frank E Harrell Jr
Professor of Biostatistics and Statistics
Division of Biostatistics and Epidemiology
Department of Health Evaluation Sciences
University of Virginia School of Medicine
http://hesweb1.med.virginia.edu/biostat




===========================================================================
  This list is open to everyone. Occasionally, people lacking respect
  for other members of the list send messages that are inappropriate
  or unrelated to the list's discussion topics. Please just delete the
  offensive email.

  For information concerning the list, please see the following web page:
  http://jse.stat.ncsu.edu/
===========================================================================

Reply via email to