I've a data set with 60000 rows of data representing 6000+ distinct loans. I 
did a coxph() regression on it (see call below), but a subsequent survfit() 
call on the coxph object is almost certainly wrong. It gives n=6 when it should 
be 
more like 6000+ (I think)

> survfit(resultag)
Call: survfit.coxph(object = resultag)

      n  events  median 0.95LCL 0.95UCL 
      6     489     Inf       2     Inf 

When I reduced the dataset to just 1000 rows, the survfit()
call on the coxph object looks more correct. 

> survfit(resulting)
Call: survfit.coxph(object = resulting)

      n  events  median 0.95LCL 0.95UCL 
    115      15     Inf     Inf     Inf 

Is there a limit to the size of the data set that I read in?
Or am I just doing something silly above?

Thanks much.
Yongchuan

(this is the coxph regression:
resultag <- coxph(Surv(Start,Stop,PrepayDate)~modBalance + 
closingCoupon+lienPosition +originalFICO,table)

______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to