[R] problems understanding error term in aov()

Jonas Mandel jonas.mandel at gmail.com
Tue Apr 28 17:01:49 CEST 2009


Hi all,

I'm having some trouble in understanding how to ste the Error() term in  
the aov() function when fitting a hierarchical ANOVA.
I have data concerning the expression of 2 miRNAs in 3 different cell 
lines, with 2 different extraction methods. The data is organized as 
follows :
    Line    Extraction    Target    Expression
1    BC54    miRNA    RNU48    22.48
2    BC54    miRNA    RNU48    22.36
3    BC54    miRNA    221    26.49
4    BC54    miRNA    221    26.42
5    BC54    RNAtot    RNU48    22.95
6    BC54    RNAtot    RNU48    22.98
7    BC54    RNAtot    221    26.35
8    BC54    RNAtot    221    26.43
9    BC108    miRNA    RNU48    24.19
10    BC108    miRNA    RNU48    24.01
11    BC108    miRNA    221    29.94
12    BC108    miRNA    221    29.61
13    BC108    RNAtot    RNU48    22.42
14    BC108    RNAtot    RNU48    22.29
15    BC108    RNAtot    221    26.79
16    BC108    RNAtot    221    26.73
17    BC233    miRNA    RNU48    21.84
18    BC233    miRNA    RNU48    21.69
19    BC233    miRNA    221    25.95
20    BC233    miRNA    221    25.98
21    BC233    RNAtot    RNU48    22.55
22    BC233    RNAtot    RNU48    22.58
23    BC233    RNAtot    221    25.11
24    BC233    RNAtot    221    25.45

For the two miRNA and for each cell line I want to compare the 
expression value between the two extraction methods. As there are no 
random effects I tried to use the aov() function to do so.  I don't know 
which of the two following models is the "good" one :

This one :
 > mir221.aov=aov(Ct~Extraction+Error(Target/Line/Extraction),data=mir221)

with this model the extraction has no significant effect :
 > summary(mir221.aov)
Error: Target
          Df Sum Sq Mean Sq F value Pr(>F)
Residuals  1 99.675  99.675              

Error: Target:Line
          Df  Sum Sq Mean Sq F value Pr(>F)
Residuals  4 16.9779  4.2445              

Error: Target:Line:Extraction
           Df  Sum Sq Mean Sq F value Pr(>F)
Extraction  1  2.8912  2.8912  1.3568 0.2966
Residuals   5 10.6545  2.1309              

Error: Within
          Df   Sum Sq  Mean Sq F value Pr(>F)
Residuals 12 0.164150 0.013679

or this one :
 > mir221.aov=aov(Ct~Extraction+Error(Target/Line),data=mir221)

with this model the extraction has a significant effect:
 > summary(mir221.lm)
Error: Target
          Df Sum Sq Mean Sq F value Pr(>F)
Residuals  1 99.675  99.675              

Error: Target:Line
          Df  Sum Sq Mean Sq F value Pr(>F)
Residuals  4 16.9779  4.2445              

Error: Within
           Df  Sum Sq Mean Sq F value  Pr(>F) 
Extraction  1  2.8912  2.8912  4.5431 0.04794 *
Residuals  17 10.8187  0.6364    

Could you please help me understanding which is the right model for my 
question ? Thanks by advance

Jonas Mandel
U900 - Bioinformatics unit
Institut Curie
Paris




More information about the R-help mailing list