[R] Advice on parsing formulae

Claus Dethlefsen dethlef at math.aau.dk
Thu Dec 16 09:33:46 CET 2004


Thank you for the advice. I have now boiled my problem down to the
following:

How do I create fm2 from fm1 ?

fm1 <-  Y ~ 1 + tvar(x:A) + tvar(z) + u + tvar(B) + tvar(poly(v,3))
fm2 <-  Y ~ 1 + x:A + z + u + B + poly(v, 3)

Thus, how do I simply remove tvar( * ) from a formula? Do I have to write a
function to parse the string and then re-create the formula? Is there an
easy way of doing this?


When my above problem is solved, I can (with the help from Heather Turner
and Chuck Berry) do the following

## define som data
x <- z <- u <- v <- rnorm(5)
A <- B <- factor( rep( c(1,2), c(3,2) ) )

## define my formula fm1 and manually create fm2.
fm1 <-  Y ~ 1 + tvar(x:A) + tvar(z) + u + tvar(B) + tvar(poly(v,3))
fm2 <-  Y ~ 1 + x:A + z + u + B + poly(v, 3)

## extract the term.labels from fm2, make the design matrix and extract
'assign'
term.labels <- unname(sapply(attr(terms(fm2), "term.labels"),
                             removeSpace))
X <- model.matrix(fm2,keep.order=TRUE)
pAssign <- attr(X, "assign")

## Now, extract the tvar-terms from fm1
tvar.terms <- terms( fm1, specials = "tvar",keep.order=TRUE )
idx <- attr(tvar.terms,"specials")$tvar
if (attr(tvar.terms,"intercept")) idx <- idx -1
tvar <- attr(terms(fm2,keep.order=TRUE),"term.labels")[idx]
tvar <- unname( sapply( tvar, removeSpace) )

## Finally, combine the information to get the vector I asked for
tvarAssign <- match(pAssign, sort(match(tvar, term.labels)))
tvarAssign[is.na(tvarAssign)] <- 0

 

> -----Original Message-----
> From: Heather Turner [mailto:Heather.Turner at warwick.ac.uk] 
> Sent: 15. december 2004 11:41
> To: r-help at stat.math.ethz.ch
> Subject: Re: [R] Advice on parsing formulae
> 
> 
> I think this will do what you want:
> 
> # Need this function to remove spaces from term labels later on
> > removeSpace <-  function(string) gsub("[[:space:]]", "", string)
> 
> # Specify which terms are in a "tvar" group
> # (could remove spaces separately)
> > tvar <- unname(sapply(c("x:A", "z", "B", "poly(v,3)"), removeSpace))
> 
> # Use terms to get term labels from formula
> > formula <- Y ~ 1 + x:A + z + u + B + poly(v,3)
> > term.labels <- unname(sapply(attr(terms(formula), 
> "term.labels"), removeSpace))
> > tvar
> [1] "x:A"       "z"         "B"         "poly(v,3)"
> > term.labels
> [1] "z"         "u"         "B"         "poly(v,3)" "x:A"
> 
> # Get assign variable for parameters
> # (You would use first two lines, but I don't have data so 
> defined assign variable myself)
> > #X <- model.matrix(formula)
> > #pAssign <- attr(X, "assign")
> > pAssign <- c(0,1,2,3,4,4,4,5,5)  
> 
> # Define "tvarAssign"
> > tvarAssign <- match(pAssign, sort(match(tvar, term.labels)))
> > tvarAssign[is.na(tvarAssign)] <- 0
> > tvarAssign
> [1] 0 1 0 2 3 3 3 4 4
> 
> HTH
> 
> Heather
> 
> Mrs H Turner
> Research Assistant
> Dept. of Statistics
> University of Warwick
> 
> >>> "Claus Dethlefsen" <dethlef at math.aau.dk> 12/13/04 04:10pm >>>
> Dear list
> 
> I would like to be able to group terms in a formula using a 
> function that I
> will call tvar(), eg. the formula
> 
> Y ~ 1 + tvar(x:A) + tvar(z) + u + tvar(B) + tvar(poly(v,3))
> 
> where x,u and v are numeric and A and B are factors - binary, say.
> 
> As output, I want the model.matrix as if tvar had not been 
> there at all. In
> addition, I would like to have information on the grouping, 
> as a vector as
> long as ncol( model.matrix ) with zeros corresponding to 
> terms outside tvar
> and with an index grouping the terms inside each tvar(). In the (sick)
> example:
> 
> 
> > model.matrix(Y ~ 1 + tvar(x:A) + tvar(z) + u + tvar(B) + 
> tvar(poly(v,3)))
>    (Intercept)     z     u B2 poly(v, 3)1 poly(v, 3)2 poly(v, 
> 3)3  x:A1
> x:A2
> 1            1 -1.55 -1.03  0       0.160      -0.350      
> -0.281  0.66
> 0.00
> 2            1 -1.08  0.55  0      -0.164      -0.211       
> 0.340  0.91
> 0.00
> 3            1  0.29 -0.26  0      -0.236      -0.073       
> 0.311 -1.93
> 0.00
> 4            1 -1.11  0.96  0       0.222      -0.285      
> -0.385 -0.23
> 0.00
> 5            1  0.43 -0.76  1      -0.434       0.515      
> -0.532  0.22
> 0.00
> 
> I would like the vector
> 
> c(0,1,0,2,3,3,3,4,4)
> 
> pointing to the tvar-grouped terms.
> 
> Thus what I would like, looks a bit like the 'assign' attribute of the
> model.matrix() output. I have not figured out a way of doing 
> this in a nice
> way and would like some help, please.
> 
> I hope somebody can help me (or point the manual-pages I should read),
> 
> Best, 
> 
> Claus Dethlefsen
> --------------------------------------------------------------
> Assistant Professor, Claus Dethlefsen, Ph.D.
> mailto:dethlef at math.auc.dk, http://www.math.auc.dk/~dethlef 
> Dpt. of Mathematical Sciences, Aalborg University
> Fr. Bajers Vej 7G, 9220 Aalborg East
> Denmark
> 

-- 
No virus found in this outgoing message.
Checked by AVG Anti-Virus.




More information about the R-help mailing list