[R] formula () problems

Dr R.K.S. Hankin rksh1 at cam.ac.uk
Wed Dec 9 21:22:44 CET 2009


Hi.

I am having difficulty creating a formula for use with glm()

I have a matrix of an unknown number of columns and wish to estimate a
coefficient for each column, and one for the each product of a column
with another column.

In the case of a five-column matrix this would be:

> x <- matrix(rnorm(100),ncol=5)
> colnames(x) <- letters[1:5]
> z <- rnorm(20)
> lm(z~ -1+(a+b+c+d+e)^2,data=data.frame(x))

Call:
lm(formula = z ~ -1 + (a + b + c + d + e)^2, data = data.frame(x))

Coefficients:
       a b c d e a:b a:c a:d -0.30021 -0.21465 0.12208 0.06308 0.28806 
0.34482 -1.00072 0.48218
     a:e       b:c       b:d       b:e       c:d       c:e       d:e  
 0.28786  -0.46306   0.39844   0.04436   0.32236  -0.09210  -1.06625  

> 

This is what I want: five single terms (a-e) and 5*(5-1)/2=10 (a:b to
d:e) for the cross terms.  If there were 6 columns I would want
(a+b+c+d+e+f)^2 and have 21 (=6+15) terms.

How do I create a formula that does this for an arbitrary number of columns?


thanks

Robin




More information about the R-help mailing list