[BioC] Questions about GOseq after DESeq2

Bernd Klaus bernd.klaus at embl.de
Thu Feb 20 16:19:01 CET 2014


Dear Amadine,

concerning your second question: I would start with all DE genes,
If it makes sense in your assay then you could also test up- and down regulated 
genes separately, using the same methodology.

GO analysis is very exploratory anyway, so I guess you have a lot of
"liberties' there.

Best wishes,

Bernd

On Feb 20, 2014, at 3:03 PM, amandine.fournier at chu-lyon.fr wrote:

> 
> Dear Bioconductor users and developers,
> 
> I have 2 questions regarding a gene ontology analysis of RNA-Seq data.
> 
> I first used the DESeq2 package to perform the differential analysis of my data and it resulted in a list of about 500 DEG. Now I would like to perform a gene ontology analysis on this dataset with the GOseq package.
> 
> According to the GOseq user manual, this package requires 2 pieces of information : the differentially expressed genes (= my list of 500 genes) and the measured genes. These are defined as "all genes for which RNA-seq data was gathered for your experiment".
> 
> As you known, DESeq2 filters out low counting genes and outliers. So I wonder if I should consider :
> - all sequenced genes including low expressed genes and outliers, 
> - or the sequenced genes whithout outliers (but including low expressed genes)
> - or only remaining genes after low count filtering (final assayed genes) ?
> 
> I don't know which step filters outliers out. It seems that they are taking into account to estimate library sizes, but I am not sure if they are filtered before or after estimating dispersion values.
> Low expressed genes are filtered out at the end of the analysis, juste before calculating FDR, so they are taking into account for calculating library sizes, estimating dispersion values, fitting the GLM model and testing, but not in the final FDR.
> In brief, I wonder if outliers and low expressed genes can be considered as "assayed" in the DESeq2 analysis.
> 
> Do you have an opinion about this ? Is there a commonly accepted / advised methodology for that ?
> 
> The second question is : should I analyse over-expressed genes and under-expressed genes together or separately ?
> 
> Many thanks in advance.
> Best regards,
> Amandine
> 
> -----
> Amandine Fournier
> Lyon Neuroscience Research Center
> & Lyon Civil Hospital (France)
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor



More information about the Bioconductor mailing list