[R] clogit and small sample sizes: what to do?

Fri Nov 7 16:54:35 CET 2008

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Conditional logistic regression and small sample sizes: what to do?

Dear R-Gurus,

[I run R 2.8.0 on WinXP. I have no formal training in statistics.]

Please feel free to skip the 'Blah blah part' for questions below.

- ---Blah blah part--------
Why I use clogit (package 'survival')?: I am looking for risk factors
for cases during a recent small "flare-up" of disease in a hospital. To
this end I matched 11 cases (no more of them, sorry!) with 33 controls
based on sex. The next step was to perform conditional logistic
regression to evaluate risk 'exposures' for cases.

Now clogit() (Package 'survival') produces some conflicting results:

Briefly, 10 of 11 cases have a certain exposure, compared to 9 of 33
controls. The odds ratio is significant (p-value = 0.000311) while the
output of clogit() is inconclusive (.95 confidence intervals for risk
factor range from 0 to infinity, conflicting 'likelihood ratio test' and
'Wald test') plus produces following warning message, "Warning message:
In fitter(X, Y, strats, offset, init, control, weights = weights,: Ran
out of iterations and did not converge." Tampering with the data (i.e.
reducing the number of cases with the exposure to 7 of 11) renders the
odds ratio unsignificant (p-value = 0.06681), yet makes clogit() assign
a significant p-value to the exposure in question (0.043). Furthermore,
the warning message is gone after the tampering.

Interestingly reducing (instead of increasing) cases with the risk
factor yields a significant result with clogit().

I believe now, that this behaviour might be due to small sample size.
Now I am looking for an R package to maybe circumvent this limitation of
clogit(). (Alternatively, I could just chuck the project and order a
Margerita. Maybe the risk factor is plainly not significant after all.
But I digress...)

Six years ago, the following message was posted to this list:

   -----old post 2002------------
Dear R folks,
We completed a matched case-control study that was analyzed using
conditional logistic regression. Because of the small sample size we
need to calculate exact confidence intervals. The quick solution is to
purchase LogExact by Cytel. However, we'd like to do this in R. Anyone
have experience with this?

Many thanks,
Tomas
____________________________________
Tomas Aragon, MD, DrPH
   -----old post 2002 end------------

At that time, the answers were not too encouraging. Anyway, I might have
the very same query 2008.

Ok, I tried to do some homework and started to look for packages maybe
implementing different methods. I did run across 'elrm' (Zamar, D.,
McNeney, B. & Graham, J., 2007), but I am unsure whether this package is
appropriate for my original 1:3 matched design.

- ---Blah blah part end--------

- ---Questions, finally!---
So my questions to you are:

1) Is the problem I describe above likely related to small sample sizes?
2) Do you have experience with 'elrm' or packages finding a solution for
the above problem? Is 1:n matching as in clogit() possible?

- ---Questions, finally! end---

Thank you for your time and effort,

Johannes

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkkUZLsACgkQeixPRMMDSjthxwCgpe0ACd6qrMVCz60yzVfzML2K
EpMAnRnrfbWDrZ29TLFKMCe+bH/YrNZI
=hb1P
-----END PGP SIGNATURE-----