Hello all, can help clarify something?

According to R's lm() doc:

Non-NULL weights can be used to indicate that different observations
have different variances (with the values in weights being inversely
*proportional* to the variances); or equivalently, when the elements
of weights are positive integers w_i, that each response y_i is the
mean of w_i unit-weight observations (including the case that there
are w_i observations equal to y_i and the data have been summarized).

Since the idea here is *proportion*, not equality, shouldn't the vectors of weights x, 2*x give the same result? And yet they don't, standard errors differs:

> summary(lm(c(1,2,3,1,2,3)~c(1,2.1,2.9,1.1,2,3),weight=rep(1,6)))$sigma
[1] 0.07108323
> summary(lm(c(1,2,3,1,2,3)~c(1,2.1,2.9,1.1,2,3),weight=rep(2,6)))$sigma
[1] 0.1005269


So what if I know a-priori, observation A has variance 2 times bigger than observation B? Both weights=c(1,2) and weights=c(2,4) (and so on) represent very well this knowledge, but we get different regression (since sigma is different).


Also, if we do the same thing with a glm() model, than we get a lot of other differences like in the deviance.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to