Re: [R] missing values in logistic regression

Frank E Harrell Jr Fri, 29 Oct 2004 13:09:19 -0700

(Ted Harding) wrote:

On 29-Oct-04 Avril Coghlan wrote:

Dear R help list,

  I am trying to do a logistic regression
where I have a categorical response variable Y
and two numerical predictors X1 and X2. There
are quite a lot of missing values for predictor X2.
eg.,

Y     X1   X2
red   0.6  0.2    *
red   0.5  0.2    *
red   0.5  NA
red   0.5  NA
green 0.2  0.1    *
green 0.1  NA
green 0.1  NA
green 0.05 0.05   *

I am wondering can I combine X1 and X2 in
a logistic regression to predict Y, using
all the data for X1, even though there are NAs in
the X2 data?

Or do I have to take only the cases for which
there is data for both X1 and X2? (marked
with *s above)

I don't know of any R routine directly aimed at logistic regression
with missing values as you describe.

The aregImpute function in the Hmisc package can handle this, using predictive mean matching with weighted multinomial sampling of donor observations' binary covariate values.

. . ..

Ted.


--------------------------------------------------------------------
E-Mail: (Ted Harding) <[EMAIL PROTECTED]>

--
Frank E Harrell Jr   Professor and Chair           School of Medicine
                     Department of Biostatistics   Vanderbilt University

______________________________________________
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html

Re: [R] missing values in logistic regression

Reply via email to