[R] Ancova_non-normality of errors

John Fox jfox at mcmaster.ca
Sun May 4 16:13:11 CEST 2008


Dear Tobias,

Your observation that "When plot [the residuals from?] this model I get a
banana-shape in Normal Q-Q Plot(with open site [side?] pointing downwards),"
suggests that the residuals are negatively skewed, which in turn suggests
that using log(wt) as the response variable may have been ill-advised.
Perhaps simply using wt, or a weaker transformation such as sqrt(wt), would
produce better-behaved residuals.

I hope this helps,
 John 

------------------------------
John Fox, Professor
Department of Sociology
McMaster University
Hamilton, Ontario, Canada
web: socserv.mcmaster.ca/jfox


> -----Original Message-----
> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org]
On
> Behalf Of Tobias Erik Reiners
> Sent: May-04-08 5:56 AM
> To: r-help at r-project.org
> Subject: [R] Ancova_non-normality of errors
> 
> Hello Helpers,
> 
> I have some problems with fitting the model for my data...
> -->my Literatur says (crawley testbook)=
> Non-normality of errors-->I get a banana shape Q-Q plot with opening
> of banana downwards
> 
> Structure of data:
>       origin   wt   pes gender
> 1      wild 5.35 147.0   male
> 2      wild 5.90 148.0   male
> 3      wild 6.00 156.0   male
> 4      wild 7.50 157.0   male
> 5      wild 5.90 148.0   male
> 6      wild 5.95 148.0   male
> 7      wild 8.55 160.5   male
> 8      wild 5.90 148.0   male
> 9      wild 8.45 161.0   male
> 10     wild 4.90 147.0   male
> 11     wild 6.80 153.0   male
> 12     wild 5.75 146.0   male
> 13     wild 8.60 160.0   male
> 14  captive 6.85 159.0   male
> 15  captive 7.00 160.0   male
> 16  captive 6.80 155.0   male
> ..
> ...
> 283    site 4.10 130.4 female
> 284    site 3.55 131.1 female
> 285    site 4.20 135.7 female
> 286    site 3.45 128.0 female
> 287    site 3.65 125.3 female
> 
> The goal of my analysis is to work out what effect the categorial
> factors(origin, gender) on the relation between
> log(wt)~log(pes)(-->Condition, fett ressource), have.
> Does the source(origin) of translocated animals have an affect on
> performance(condition)in the new area?
> I have already a best fit model and it looks quite good (or not?see
below).
> 
> two slopes(gender difference)and 6 intercepts(3origin levels*2gender
levels)
> 
> lm(formula = log(wt) ~ log(pes) + origin + gender + gender:log(pes))
> 
> Residuals:
>       Min       1Q   Median       3Q      Max
> -0.54181 -0.07671  0.01520  0.09474  0.28818
> 
> Coefficients:
>                      Estimate Std. Error t value Pr(>|t|)
> (Intercept)         -7.39879    1.97605  -3.744 0.000219 ***
> log(pes)             1.78020    0.40118   4.437 1.31e-05 ***
> originsite           0.06572    0.01935   3.397 0.000781 ***
> originwild           0.07655    0.03552   2.155 0.032011 *
> gendermale          -9.32418    2.37476  -3.926 0.000109 ***
> log(pes):gendermale  1.90393    0.47933   3.972 9.06e-05 ***
> ---
> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
> 
> Residual standard error: 0.1433 on 281 degrees of freedom
> Multiple R-Squared: 0.7227,     Adjusted R-squared: 0.7177
> F-statistic: 146.4 on 5 and 281 DF,  p-value: < 2.2e-16
> 
> When plot this model I get a banana-shape in Normal Q-Q Plot(with open
> site pointing downwards) , indicating non-normality of my data....how
> to handle this?
> 
> -->Do I have unbalanced data?
>        captive    site    wild
> n-->     119     149      19
> 
> My problem is that I see that my data is not as good as the
> modelsummary tells.
> Should I include another term in my model formular?
> 
> I think I have to differenciate more, but I don't know
> how.(contrasts?, TukeyHSD?,Akaike Information Criterion? or lme())to
> many different ways out there.
> 
> Cheers,
> Tobi
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list