[R] How to represent the effect of one covariate on regression results?

Abby Spurdle @purd|e@@ @end|ng |rom gm@||@com
Tue Sep 15 05:12:01 CEST 2020


I'm wondering if you want one of these:
(1) Plots of "Main Effects".
(2) "Partial Residual Plots".

Search for them, and you should be able to tell if they're what you want.

But a word of warning:

Many people (including many senior statisticians) misinterpret this
kind of information.
Because, it's always the effect of xj on Y, while holding the other
variables *constant*.
That's not as simple as it sounds, and people have a tendency of
disregarding the importance of the second half of that sentence, in
their final interpretations.


P.S.
John Fox, announced a package with support for Regression Diagnostics,
about 11 days ago:
https://stat.ethz.ch/pipermail/r-help/2020-September/468609.html

I'm not sure how relevant it is to your question, but I just glanced
at the vignette, and it's pretty slick...




On Tue, Sep 15, 2020 at 1:30 AM Ana Marija <sokovic.anamarija using gmail.com> wrote:
>
> Hello,
>
> I was running association analysis using --glm genotypic from:
> https://www.cog-genomics.org/plink/2.0/assoc with these covariates:
> sex,age,PC1,PC2,PC3,PC4,PC5,PC6,PC7,PC8,PC9,PC10,TD,array,HBA1C. The
> result looks like this:
>
>     #CHROM    POS    ID    REF    ALT    A1    TEST    OBS_CT    BETA
>   SE    Z_OR_F_STAT    P    ERRCODE
>     10    135434303    rs11101905    G    A    A    ADD    11863
> -0.110733    0.0986981    -1.12193    0.261891    .
>     10    135434303    rs11101905    G    A    A    DOMDEV    11863
> 0.079797    0.111004    0.718868    0.472222    .
>     10    135434303    rs11101905    G    A    A    sex=Female
> 11863    -0.120404    0.0536069    -2.24605    0.0247006    .
>     10    135434303    rs11101905    G    A    A    age    11863
> 0.00524501    0.00391528    1.33963    0.180367    .
>     10    135434303    rs11101905    G    A    A    PC1    11863
> -0.0191779    0.0166868    -1.14928    0.25044    .
>     10    135434303    rs11101905    G    A    A    PC2    11863
> -0.0269939    0.0173086    -1.55957    0.118863    .
>     10    135434303    rs11101905    G    A    A    PC3    11863
> 0.0115207    0.0168076    0.685448    0.493061    .
>     10    135434303    rs11101905    G    A    A    PC4    11863
> 9.57832e-05    0.0124607    0.0076868    0.993867    .
>     10    135434303    rs11101905    G    A    A    PC5    11863
> -0.00191047    0.00543937    -0.35123    0.725416    .
>     10    135434303    rs11101905    G    A    A    PC6    11863
> -0.0103309    0.0159879    -0.646172    0.518168    .
>     10    135434303    rs11101905    G    A    A    PC7    11863
> 0.00790997    0.0144025    0.549207    0.582863    .
>     10    135434303    rs11101905    G    A    A    PC8    11863
> -0.00205639    0.0142709    -0.144096    0.885424    .
>     10    135434303    rs11101905    G    A    A    PC9    11863
> -0.00873771    0.0057239    -1.52653    0.126878    .
>     10    135434303    rs11101905    G    A    A    PC10    11863
> 0.0116197    0.0123826    0.938388    0.348045    .
>     10    135434303    rs11101905    G    A    A    TD    11863
> -0.670026    0.0962216    -6.96337    3.32228e-12    .
>     10    135434303    rs11101905    G    A    A    array=Biobank
> 11863    0.160666    0.073631    2.18205    0.0291062    .
>     10    135434303    rs11101905    G    A    A    HBA1C    11863
> 0.0265933    0.00168758    15.7583    6.0236e-56    .
>     10    135434303    rs11101905    G    A    A    GENO_2DF    11863
>   NA    NA    0.726514    0.483613    .
>
> This results is shown just for one ID (rs11101905) there is about 2
> million of those in the resulting file.
>
> My question is how do I present/plot the effect of covariate "TD" in
> the example it has "P" equal to 3.32228e-12 for all IDs in the
> resulting file so that I show how much effect covariate "TD" has on
> the analysis. Should I run another regression without covariate "TD"
> and than do scatter plot of P values with and without "TD" covariate
> or there is a better way to do this from the data I already have?
>
> Thanks
> Ana
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list