Re: [R] Linear regression with a rounded response variable

Charles C. Berry Wed, 21 Oct 2015 11:00:02 -0700

On Wed, 21 Oct 2015, Ravi Varadhan wrote:

Hi, I am dealing with a regression problem where the response variable,time (second) to walk 15 ft, is rounded to the nearest integer. I donot care for the regression coefficients per se, but my main interest isin getting the prediction equation for walking speed, given thepredictors (age, height, sex, etc.), where the predictions will be realnumbers, and not integers. The hope is that these predictions shouldprovide unbiased estimates of the "unrounded" walking speed. Thesesounds like a measurement error problem, where the measurement error isdue to rounding and hence would be uniformly distributed (-0.5, 0.5).

Not the usual "measurement error model" problem, though, where the errorsare in X and not independent of XB.

Look back at the proof of the unbiasedness of least squares under theGauss-Markov setup. The errors in Y need to have expectation zero.

From your description (but see caveat below) this is true of walking

*time*, but not not exactly true of walking *speed* (modulo the usualassumptions if they apply to time). In fact if E(epsilon) = 0 were true ofunrounded time, it would not be true of unrounded speed (and vice versa).

Are there any canonical approaches for handling this type of a problem?

Work out the bias analytically? Parametric bootstrap? Data augmentationand friends?

What is wrong with just doing the standard linear regression?


Well, what do the actual values look like?

If half the subjects have a value of 5 seconds and the rest are splitbetween 4 and 6, your assertion that rounding induces an error ofdunif(epsilon,-0.5,0.5) is surely wrong (more positive errors in the 6second group and more negative errors in the 4 second group under anyplausible model).



HTH,

Chuck

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Linear regression with a rounded response variable

Reply via email to