[BioC] library size and fold changes

Gordon K Smyth smyth at wehi.EDU.AU
Wed Dec 7 23:32:52 CET 2011


Dear Bogdan,

Give that library size may affect FDR, but will not affect FC (even might 
increase it slightly), it would seem to me more natural to relax the FDR 
cutoff rather than the FC cutoff.  I would use the same FC cutoff 
regardless of library size.

This is especially so because, once counts get to a certain size, the 
p-value under the negative binomial model depends only on the fold change, 
further increases in count size making little or no difference.  This is 
because the sequencing variability become negligible for large counts, 
after which biological inter-library variability is the only soure of 
variation.

What is a sensible analysis for your current data might of course depend 
on many things, which we don't know from your email

Best wishes
Gordon

---------------------------------------------
Professor Gordon K Smyth,
Bioinformatics Division,
Walter and Eliza Hall Institute of Medical Research,
1G Royal Parade, Parkville, Vic 3052, Australia.
Tel: (03) 9345 2326, Fax (03) 9347 0852,
smyth at wehi.edu.au
http://www.wehi.edu.au
http://www.statsci.org/smyth

On Tue, 6 Dec 2011, Bogdan Tanasa wrote:

> Dear Mark, Gordon,
>
> probably a naive statistical question on edgeR : considering 3 samples of
> 3 library sizes : a) 10 mil reads, b) 30 mil reads, and c) 12 mil reads.
>
> after applying edgeR, I do obtain 1) > 2000 genes differentially expressed
> between a) and b) (FDR< 0.01, FC > 2), and 2) only ~ 200 genes
> differentially expressed between a) and c) (FDR < 0.01, FC >2).
>
> my question would be : given the fact that the number of differentially
> expressed genes is dependent on the library size, would it be valid to
> compare and contrast the set 1) of 2000 differentially expressed genes (FDR
> < 0.01, FC >2), with an expanded set 2) of 200+800 differentially expressed
> genes (FDR < 0.01, BUT FC > 1.2).
>
> thanks a lot,
>
> Bogdan
>

______________________________________________________________________
The information in this email is confidential and intend...{{dropped:4}}



More information about the Bioconductor mailing list