[BioC] minimal number of features tested in edgeR
guest at bioconductor.org
Thu Oct 25 11:23:20 CEST 2012
I have a question regarding the minimal number of genes that we can test in an analysis with edgeR. Let me explain, in a study, edgeR have been used for testing the differential expression of three viruses between two conditions, without considering the counts on other features. That is, the data frame d$counts has only three lines (and 4 columns, as there is two replicates per condition). The library sizes, however, correspond to the total number of tags aligned both on these viruses and on the genes of the host organism. It seems inappropriate to me, as I don't understand how it would be possible to estimate reliably the dispersion from only three features, but maybe I'm wrong... May I have your opinion?
For you, what is the minimal number of features that we can test using edgeR?
Thank you by advance for your help.
-- output of sessionInfo():
R version 2.15.0 (2012-03-30)
Platform: x86_64-pc-linux-gnu (64-bit)
 LC_CTYPE=fr_FR.UTF-8 LC_NUMERIC=C
 LC_TIME=fr_FR.UTF-8 LC_COLLATE=fr_FR.UTF-8
 LC_MONETARY=fr_FR.UTF-8 LC_MESSAGES=fr_FR.UTF-8
 LC_PAPER=C LC_NAME=C
 LC_ADDRESS=C LC_TELEPHONE=C
 LC_MEASUREMENT=fr_FR.UTF-8 LC_IDENTIFICATION=C
attached base packages:
 stats graphics grDevices utils datasets methods base
other attached packages:
 edgeR_2.6.2 limma_3.12.0
loaded via a namespace (and not attached):
 annotate_1.34.0 AnnotationDbi_1.18.0 Biobase_2.16.0
 BiocGenerics_0.2.0 DBI_0.2-5 DESeq_1.8.2
 genefilter_1.38.0 geneplotter_1.34.0 grid_2.15.0
 IRanges_1.14.3 RColorBrewer_1.0-5 RSQLite_0.11.1
 splines_2.15.0 stats4_2.15.0 survival_2.36-14
Sent via the guest posting facility at bioconductor.org.
More information about the Bioconductor