Hi:

       I think I found an error in plot.lm with the option which=5, of course I 
can be wrong , as usually happen,  but I had work on it for a while and show it 
to some other people that work with R, and so far I don't see what I can be 
interpreting wrong. I also worked over the plot.lm's code and change some lines 
to get what I call "the right plot",  if any body is interested I can send the 
modified code to see what is the problem I think I found and what could be a 
solution.

 

I´m working with R 2.4.0 on windows XP, and here is a reproducible example, 
(this example is just to show the problems in the plot and it doesn't make any 
sense the way I analyzed).

 

set.seed(3)

datos <-data.frame(fac.A=rep(c("bla","Ur2","pel","arb"),each=3),

                       y= c(rnorm(3,sd=0.5),rnorm(9,sd=2)))

model1 <- lm(y~fac.A,data=datos)

plot(model1,which=5)   # plot1 

 

# this plot1 show that level "arb" has less dispersion than the other levels, 

# But if  I do the plot by myself, look:

 

hii <- lm.influence(model1, do.coef = FALSE)$hat

s1 <- sqrt(deviance(model1)/df.residual(model1))

rs <- residuals(model1)/(s1 * sqrt(1 - hii))

 

plot(rs~datos$fac.A)   # plot2

 

# this plot2 show me that level "bla"  is less variable.

# also per and Url have some problems but this give you the idea of what I 
think Is wrong.

 

What I have found in the code, is that for this option (which=5),  the labels 
of the x axis are ordered in a way that the predicted value for the levels are 
increasing, but when it actually do the plot it doesn't keep that order.

 

Thanks for your time (and sorry for my English) .

                                     Gabriela

        [[alternative HTML version deleted]]

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to