[R] singular information matrix in lrm.fit

Gad Abraham gabraham at csse.unimelb.edu.au
Sun Oct 12 05:29:58 CEST 2008


Hi,

I'm trying to do binary logistic regression on 10 covariables, comparing 
glm to lrm from Harrell's Design package. They don't seem to agree on 
whether the data is collinear:

 > library(Design)
 > load(url("http://www.csse.unimelb.edu.au/~gabraham/data.Rdata"))
 > lrm(y ~ X1 + X2 + X3 + X4 + X5 + X6 + X7 + X8 + X9 + X10, data=x)
singular information matrix in lrm.fit (rank= 10 ).  Offending variable(s):
X10
Error in j:(j + params[i] - 1) : NA/NaN argument

If I understand correctly, lrm is complaining about collinearity in the 
data. However, the rank of the matrix is 10:
 > qr(x)$rank
[1] 10

glm doesn't seem to care about the supposed collinearity, but does say 
that the data are perfectly separable:

 > glm(y ~ X1 + X2 + X3 + X4 + X5 + X6 + X7 + X8 + X9 + X10, data=x,
+    family=binomial(), control=glm.control(maxit=50))

Call:  glm(formula = y ~ X1 + X2 + X3 + X4 + X5 + X6 + X7 + X8 + X9 + 
    X10, family = binomial(), data = x, control = glm.control(maxit = 50))

Coefficients:
(Intercept)           X1           X2           X3           X4 
   X5
  -6.921e+03    7.185e-02    4.344e-02   -3.980e-02   -5.362e-02 
-6.387e-03
          X6           X7           X8           X9          X10
   2.455e-01    2.753e-02   -1.848e-01    1.903e-01   -3.187e-02

Degrees of Freedom: 27 Total (i.e. Null);  17 Residual
Null Deviance:      38.82
Residual Deviance: 4.266e-10    AIC: 22
Warning message:
In glm.fit(x = X, y = Y, weights = weights, start = start, etastart = 
etastart,  :
   fitted probabilities numerically 0 or 1 occurred


What's the reason for this discrepancy?

Thanks,
Gad


-- 
Gad Abraham
Dept. CSSE and NICTA
The University of Melbourne
Parkville 3010, Victoria, Australia
email: gabraham at csse.unimelb.edu.au
web: http://www.csse.unimelb.edu.au/~gabraham



More information about the R-help mailing list