In the Insurance dataset both Age and Group are ordered factors so the default 
encoding for them is orthogonal polynomials (assuming that the user has not 
changed the default).  In the output below the .L indicates that line is for 
the "Linear" piece of the encoding or the Linear contrast on the groups, .Q is 
for the "Quadratic" piece/contrast and .C is for "Cubic".  If you don't 
understand what is meant by linear/quadratic/cubic, then do some background 
reading on orthogonal polynomials.

If you read the data in yourself from a .csv file, then Age and Group will not 
be ordered factors unless you specifically convert them to be.  Therefore the 
default encoding will be something other than orthogonal polynomials and the 
specific details will be different (though the overall effect will be the same).

Hope this helps,

-- 
Gregory (Greg) L. Snow Ph.D.
Statistical Data Center
Intermountain Healthcare
greg.s...@imail.org
801.408.8111


> -----Original Message-----
> From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-
> project.org] On Behalf Of choonhong ang
> Sent: Monday, February 23, 2009 10:05 AM
> To: r-help@r-project.org
> Subject: [R] Insurance data in library(MASS)
> 
> I have used the insurance data from R library and I have 2 questions:
> I use the following:
> >library(MASS)
> >data(Insurance)
> > m1=glm(Claims ~ District + Group + Age + offset(log(Holders)),data =
> Insurance, family = poisson)
> >summary(m1)
> 
> Call:
> glm(formula = Claims ~ District + Group + Age + offset(log(Holders)),
>     family = poisson, data = Insurance)
> Deviance Residuals:
>      Min        1Q    Median        3Q       Max
> -2.46558  -0.50802  -0.03198   0.55555   1.94026
> Coefficients:
>              Estimate Std. Error z value Pr(>|z|)
> (Intercept) -1.810508   0.032972 -54.910  < 2e-16 ***
> District2    0.025868   0.043016   0.601 0.547597
> District3    0.038524   0.050512   0.763 0.445657
> District4    0.234205   0.061673   3.798 0.000146 ***
> Group.L      0.429708   0.049459   8.688  < 2e-16 ***
> Group.Q      0.004632   0.041988   0.110 0.912150
> Group.C     -0.029294   0.033069  -0.886 0.375696
> Age.L       -0.394432   0.049404  -7.984 1.42e-15 ***
> Age.Q       -0.000355   0.048918  -0.007 0.994210
> Age.C       -0.016737   0.048478  -0.345 0.729910
> ---
> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> (Dispersion parameter for poisson family taken to be 1)
>     Null deviance: 236.26  on 63  degrees of freedom
> Residual deviance:  51.42  on 54  degrees of freedom
> AIC: 388.74
>  (1) In the result above, what is Group.L, Group.Q, Group.C, Age.L,
> Age.Q,
> Age.C ?
> 
>  (2) When I copy the Insurance data in csv format (as shown in the
> attachement) and run the same procedure the result shown is different
> from
> above result, why ?

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to