Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Marc Schwartz
On Jun 13, 2010, at 10:20 PM, array chip wrote: Hi, this is not R technical question per se. I know there are many excellent statisticians in this list, so here my questions: I have dataset with ~1800 observations and 50 independent variables, so there are about 35 samples per variable. Is

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Joris Meys
Hi, Marcs explanation is valid to a certain extent, but I don't agree with his conclusion. I'd like to point out the curse of dimensionality(Hughes effect) which starts to play rather quickly. The curse of dimensionality is easily demonstrated looking at the proximity between your datapoints.

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Claudia Beleites
Dear all, (this first part of the email I sent to John earlier today, but forgot to put it to the list as well) Dear John, Hi, this is not R technical question per se. I know there are many excellent statisticians in this list, so here my questions: I have dataset with ~1800 observations

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Robert A LaBudde
I think the real issue is why the fit is being done. If it is solely to interpolate and condense the dataset, the number of variables is not an important issue. If the issue is developing a model that will capture causality, it is hard to believe that can be accomplished with 50+ variables.

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Marc Schwartz
Joris, There are two separate issues here: 1. Can you consider an LR model with 50 covariates? 2. Should you have 50 covariates in your LR model? The answer to 1 is certainly yes, given what I noted below as a general working framework. I have personally been involved with the development

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Charles C. Berry
On Mon, 14 Jun 2010, Joris Meys wrote: Hi, Marcs explanation is valid to a certain extent, but I don't agree with his conclusion. I'd like to point out the curse of dimensionality(Hughes effect) which starts to play rather quickly. Ahem! ... minimal, self-contained, reproducible code ...

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread array chip
correlation, How would LR model behave? Thanks John - Original Message From: Charles C. Berry cbe...@tajo.ucsd.edu To: Joris Meys jorism...@gmail.com Cc: r-help@r-project.org; Marc Schwartz marc_schwa...@me.com Sent: Mon, June 14, 2010 8:32:02 AM Subject: Re: [R] logistic regression

Re: [R] logistic regression with 50 varaibales

2010-06-14 Thread Charles C. Berry
: Charles C. Berry cbe...@tajo.ucsd.edu To: Joris Meys jorism...@gmail.com Cc: r-help@r-project.org; Marc Schwartz marc_schwa...@me.com Sent: Mon, June 14, 2010 8:32:02 AM Subject: Re: [R] logistic regression with 50 varaibales On Mon, 14 Jun 2010, Joris Meys wrote: Hi, Marcs explanation is valid

[R] logistic regression with 50 varaibales

2010-06-13 Thread array chip
Hi, this is not R technical question per se. I know there are many excellent statisticians in this list, so here my questions: I have dataset with ~1800 observations and 50 independent variables, so there are about 35 samples per variable. Is it wise to build a stable multiple logistic model