The data set in R is wrong. I've found mistakes on 2 lines in a quick look. 
  
  I don't know if the data is incorrect in the Appendix of Fleming and 
Harrington as well (someone seems to have borrowed my copy), which is where the 
data set appears to have been taken from, given all the "-9" codes in it. 
(Note, 
Tom Fleming originally got the data from me, so I'm fairly confident in calling 
my Mayo version the authoritative one).  I'll make sure this gets fixed.
  
  You can grab a correct data set from our department web page.  Code is below.
  
        Terry Therneau
        
  
pbcurl <- 
"http://mayoresearch.mayo.edu/mayo/research/biostat/upload/therneau_upload/pbc.d
at"

pbc <- read.table(pbcurl, header=F, 
                  col.names=c('id', 'time', 'status', 'trt',  'age', 'sex',
                              'ascites',  'hepato',  'spiders',  'edema',
                              'bili',  'chol',  'albumin',  'copper', 
                              'alk.phos',  'ast',  'trig',  'platelet',
                              'protime',  'stage'),
                  na.strings='.')
pbc$age <- pbc$age/365.25 

newfit <- coxph(Surv(time, status==2) ~ age + edema + log(bili) +
        log(protime) + log(albumin), data=pbc)

newfit
                coef exp(coef) se(coef)     z       p
age           0.0396    1.0404  0.00767  5.16 2.4e-07
edema         0.8963    2.4505  0.27141  3.30 9.6e-04
log(bili)     0.8636    2.3716  0.08294 10.41 0.0e+00
log(protime)  2.3868   10.8791  0.76851  3.11 1.9e-03
log(albumin) -2.5069    0.0815  0.65292 -3.84 1.2e-04

Likelihood ratio test=231  on 5 df, p=0  n=416 (2 observations deleted due to 
missingness)

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to