[R] Strange scope problem

Uwe Ligges ligges at statistik.uni-dortmund.de
Thu Oct 16 10:52:58 CEST 2003


Angelo Canty wrote:

> Hi,
> 
> I have come across the following problem which seems to be a scoping
> issue but I'm at a loss to see why this is so or to find a good
> workaround.
> 
> Suppose I have a function to get a prediction after model selection
> using the step function.
> 
> step.pred <- function(dat, x0) {
>   fit.model <- step(lm(y~., data=dat), trace=F)
>   predict(fit.model, x0, se.fit=T)
> }
> 
> This function works sometimes for example
> 
> set.seed(1)
> X.1 <- data.frame(x1=rnorm(20), x2=rnorm(20), x3=rnorm(20))
> y.1 <- 5+as.matrix(X.1[,1:2])%*%matrix(c(1,1))+rnorm(20)
> Xy.1 <- data.frame(X.1,y=y.1)
> x0.1 <- data.frame(x1=-1,x2=-1, x3=-1)
> step.pred(Xy.1, x0.1)
> 
> $fit
> [1] 3.359540
> 
> $se.fit
> [1] 0.523629
> 
> $df
> [1] 16
> 
> $residual.scale
> [1] 1.093526
> 
> but most often it crashes as in

It does not crash. Your outdated version reports an error. R-1.8.0 does 
not. So please upgrade!

Uwe Ligges


> set.seed(2)
> X.2 <- data.frame(x1=rnorm(20), x2=rnorm(20), x3=rnorm(20))
> y.2 <- 5+as.matrix(X.2[,1:2])%*%matrix(c(1,1))+rnorm(20)
> Xy.2 <- data.frame(X.2,y=y.2)
> x0.2 <- data.frame(x1=-1,x2=-1, x3=-1)
> step.pred(Xy.2, x0.2)
> Error in model.frame.default(formula = y ~ x1 + x2, data = dat,
> drop.unused.levels = TRUE) : 
>         Object "dat" not found
> 
> The difference seems to be that for the first dataset, step retains
> all three variables whereas for the second it drops one of them.
> 
> 
>>step(lm(y~.,data=Xy.1), trace=F)
> 
> 
> Call:
> lm(formula = y ~ x1 + x2 + x3, data = Xy.1)
> 
> Coefficients:
> (Intercept)           x1           x2           x3  
>      4.8347       0.8937       1.0331      -0.4516  
> 
> 
>>step(lm(y~.,data=Xy.2), trace=F)
> 
> 
> Call:
> lm(formula = y ~ x1 + x2, data = Xy.2)
> 
> Coefficients:
> (Intercept)           x1           x2  
>      5.0802       0.9763       1.1369  
> 
> 
> One possible workaround is to explicitely assign the local variable
> dat in the .GlobalEnv as in
> 
> step.pred1 <- function(dat, x0) {
>   assign("dat",dat, envir=.GlobalEnv)
>   fit.model <- step(lm(y~., data=dat), trace=F)
>   predict(fit.model, x0, se.fit=T)
> }
> 
> I don't like this method since it would overwrite anything else called
> dat in .GlobalEnv.  I realize that I could give it an obscure name but
> the potential for damage still remains.  Am I missing something obvious
> here?  If not, is it possible to work around this problem in such a way
> that .GlobalEnv does not need to be touched?
> 
> In S-Plus I would use 
> assign("dat",dat, frame=1)
> which works but that is not available (AFAIK) in R.  Is there
> something similar that I can use?
> 
> I am using R 1.6.1 for Unix on a Sun Workstation. I know that I need
> to upgrade but our sysadmin doesn't regard it as priority!  
 >
> Thanks for any help you can give for this.
> Angelo
> 
> ------------------------------------------------------------------
> |   Angelo J. Canty                Email: cantya at mcmaster.ca     |
> |   Mathematics and Statistics     Phone: (905) 525-9140 x 27079 |
> |   McMaster University            Fax  : (905) 522-0935         |
> |   1280 Main St. W.                                             |
> |   Hamilton ON L8S 4K1                                          |
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help




More information about the R-help mailing list