[R] anova comparisons

Sat Feb 23 19:11:25 CET 2013

On Feb 23, 2013, at 10:06 , Rolf Turner wrote:

>> 
>> What am I missing?
> 
> A basic understanding of the theory of linear models.  This really has little
> to do with R.  Go and read a good intro to linear modelling.
> 
> Insofar as your question has anything to do with R:
> 
> When you do
> 
>    anova(m1, m2, m3, test="F")
> 
> the mean squared error from m3 is used as the denominator of the F statistic.
> 
> When you do
> 
>    anova(m1, m2, test="F")
> 
> the mean squared error from m2 is used as the denominator of the F statistic.

Maybe a bit quick there. It's actually about (computer) conventions for ANOVA tables; the theory would prefer you to do what the poster expects: Test sequentially and update the denominator MS along the route. However, lazy programmers (and before them lazy practising statisticians) decided that it is easier to use the MS from the biggest model throughout.

(There's also a post-hoc argument, that said MS is less susceptible to small departures from the model assumptions, but I'd say that the main reason is convenience.)

Even though it is a common convention, you do need exposure to at least one statistical computer program or one traditional analysis of variance textbook to see it. 

-- 
Peter Dalgaard, Professor,
Center for Statistics, Copenhagen Business School
Solbjerg Plads 3, 2000 Frederiksberg, Denmark
Phone: (+45)38153501
Email: pd.mes at cbs.dk  Priv: PDalgd at gmail.com