[R] coxph strange result

David Winsemius dwinsemius at comcast.net
Fri Nov 26 00:37:19 CET 2010


On Nov 25, 2010, at 5:16 PM, Bond, Stephen wrote:

> The following fit does not make sense to me, please, correct me if I  
> have a logical error.
>
>> moddowsn
> Call:
> coxph(formula = Surv(start, stop, resp) ~ sn + matfac2, data = coxsn1,
>    method = "efron")
>
>
>            coef exp(coef) se(coef)       z       p
> sn2       0.0497     1.051  0.02030   2.450 1.4e-02
> sn3      -0.0532     0.948  0.02038  -2.610 9.0e-03
> sn4      -0.0410     0.960  0.01979  -2.073 3.8e-02
> sn5      -0.0776     0.925  0.01954  -3.973 7.1e-05
> sn6      -0.1133     0.893  0.01839  -6.161 7.2e-10
> sn7      -0.1252     0.882  0.01846  -6.781 1.2e-11
> sn8      -0.1222     0.885  0.01994  -6.130 8.8e-10
> sn9      -0.0507     0.951  0.02047  -2.478 1.3e-02
> sn10     -0.0444     0.957  0.02056  -2.159 3.1e-02
> sn11     -0.0433     0.958  0.02157  -2.008 4.5e-02
> sn12     -0.0114     0.989  0.02037  -0.557 5.8e-01
> matfac22 -0.2599     0.771  0.01727 -15.048 0.0e+00
> matfac25 -0.1804     0.835  0.00924 -19.512 0.0e+00
>
> Likelihood ratio test=651  on 13 df, p=0  n= 253802
>
> This would indicate that in sn6 to sn8 there is less of a chance of  
> an event. ?? do the relative frequencies implied by the following  
> table make any sense??
>
>> table(coxsn1$matfac2,coxsn1$sn,coxsn1$resp)
> , ,  = 0
>
>
>        1     2     3     4     5     6     7     8     9    10     
> 11    12
>  1  3407  3177  3425  3348  3975  3564  3181  3077  2894  2610   
> 3441  3443
>  2   920  1005  1142  1327  1645  1530  1330  1184   964   864    
> 888   860
>  5  9036  9507 10258 11888 16826 15575 13394 12346  9938  9001   
> 8970  8599
>
> , ,  = 1
>
>
>        1     2     3     4     5     6     7     8     9    10     
> 11    12
>  1  1453  1459  1186  1496  1295  1754  1429  1153  1106  1234    
> 965  1532
>  2   312   290   330   390   454   539   479   367   295   276    
> 256   267
>  5  2994  3207  3371  3629  4095  5581  5837  3844  3400  3199   
> 2705  3084
>
> Apparently the frequency of an event is higher in the summer months.

Without knowing how the data was prepared, agreement would seem highly  
speculative on our parts. We haven't been told very much about this  
dataset apart from the cryptic variable names apparently referring to  
calendar months regarding some aspect of the cases.


> I apologize for not being able to disclose the dataset, but think  
> that the table provides enough to address the question.

I disagee. Suppose sn2 cases began observation in February and then  
had events in July or September. Or even supposing that observations  
are confined within a single month. If on average, the durations are  
longer within those months than in others, then you could get lower  
rates without having lower even frequencies.

> Thanks everybody.
>
>
> Stephen B

David Winsemius, MD
West Hartford, CT



More information about the R-help mailing list