[R] coded to categorical variables in a large dataset

Richard M. Heiberger rmh at temple.edu
Fri Dec 29 20:11:14 CET 2006


## The main reason for wanting such a coding is to use it in
## a linear model.  Therefore, declare the variable to be a factor
## and use it directly.


tmp <- sample(1:5, 40, replace=TRUE)
tmpf <- factor(tmp)
tmp.y <- rnorm(40)

tmp.aov <- aov(tmp.y ~ tmpf)
summary(tmp.aov)

contrasts(tmpf)

update(tmp.aov, x=TRUE)$x[1:6,]

## If you really want to see the redundant column 1 of
## of the contrasts, that can be done with the statement

contrasts(tmpf)
contrasts(tmpf, how.many=5) <- contr.treatment(5, contrasts=FALSE)
contrasts(tmpf)

tmp2.aov <- aov(tmp.y ~ tmpf)
summary(tmp2.aov)
update(tmp2.aov, x=TRUE)$x[1:6,]



More information about the R-help mailing list