[BioC] DESeq2

Wolfgang Huber whuber at embl.de
Tue Mar 12 21:51:39 CET 2013


Dear DESeq users,

Mike Love, Simon Anders and I have been updating the DESeq package. This resulted in the package DESeq2, which is available from the development branch, and scheduled for the next release: http://www.bioconductor.org/packages/devel/bioc/html/DESeq2.html

For several release cycles, the original package (DESeq) will be maintained at its current functionality, in order to not disrupt the workflows of DESeq users. For new projects, we recommend using DESeq2. Major innovations are:

* Base class: SummarizedExperiment is used as the superclass for storing the data, rather than eSet.  This allows closer integration with upstream workflows involving GRanges and summarizeOverlaps, and facilitates downstream analyses of the genomic regions of interest.

* Simplified workflow: the wrapper function DESeq() performs all steps for a differential expression analysis. The individual steps are of course also accessible.

* More powerful statistics: incorporation of prior distributions into the estimation of dispersions and fold changes (empirical-Bayes shrinkage). The dispersion shrinkage improves power compared to the old DESeq. The fold changes shrinkage help moderate the otherwise large spread in log fold changes for genes with low counts, while it has negligible effect on genes with high counts; it may be particularly useful for visualisation, clustering, classification, ordination (PCA, MDS), similar to the variance-stabilizing transformation in the old DESeq. A Wald test for significance is provided as the default inference method, with the chi-squared test of the previous version is also available. A manuscript is in preparation. 

* Normalization: it is possible to provide a matrix of sample- *and* gene-specific normalization factors, which allows the use of normalisation factors from Bioconductor packages such as cqn and EDASeq.

Examples of usage are provided in the vignette, and more details are available in the manual pages (specifically, the DESeq function and estimateDispersions function).

Enjoy - 

	Mike, Simon, Wolfgang.



More information about the Bioconductor mailing list