[BioC] RNAseq expression threshold using DESeq2 normalised counts

Michael Love michaelisaiahlove at gmail.com
Sun Jul 20 17:07:57 CEST 2014


hi Aditi,

I don't have any concrete suggestions for this -- how to determine if
a gene is expressed or not.

Mike

On Sat, Jul 19, 2014 at 11:51 AM, Aditi [guest] <guest at bioconductor.org> wrote:
> Hi Mike,
>
> This is a question similar to posted on biostars a few months ago (https://www.biostars.org/p/94680/) that you came across.
>
> I want to determine if a gene is expressed or not using RNAseq data. Though there is quite a discussion on it with papers defining range of FPKM values (generally generated using cufflinks ) as a cutoff to say that a gene is expressed.
>
> Can we rather use normalised counts from DESeq2- look at the distribution and determine a suitable cutoff. Better still if one has negative controls like spike ins in the RNA protocol use that a cutoff ? ( I unfortunately dont have spike in control data)
>
> Or do you think one should extract FPKM values and then use maybe a zFPKM transformation (http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000598) like most people are suggesting
>
> I look forward to your opinion and suggestion,
>
> Thanks !
> Aditi
>
>
>
>
>
>  -- output of sessionInfo():
>
>  -- output of sessionInfo():
>
> R version 3.1.0 (2014-04-10)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
>  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
>  [9] LC_ADDRESS=C               LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] parallel  stats     graphics  grDevices utils     datasets  methods
> [8] base
>
> other attached packages:
>  [1] DESeq2_1.4.5            RcppArmadillo_0.4.300.0 Rcpp_0.11.1
>  [4] EDASeq_1.10.0           aroma.light_2.0.0       matrixStats_0.8.14
>  [7] ShortRead_1.22.0        GenomicAlignments_1.0.1 BSgenome_1.32.0
> [10] Rsamtools_1.16.0        GenomicRanges_1.16.3    GenomeInfoDb_1.0.2
> [13] Biostrings_2.32.0       XVector_0.4.0           IRanges_1.22.7
> [16] BiocParallel_0.6.1      Biobase_2.24.0          BiocGenerics_0.10.0
>
> loaded via a namespace (and not attached):
>  [1] annotate_1.42.0      AnnotationDbi_1.26.0 BatchJobs_1.2
>  [4] BBmisc_1.6           bitops_1.0-6         brew_1.0-6
>  [7] codetools_0.2-8      DBI_0.2-7            DESeq_1.16.0
> [10] digest_0.6.4         fail_1.2             foreach_1.4.2
> [13] genefilter_1.46.1    geneplotter_1.42.0   grid_3.1.0
> [16] hwriter_1.3          iterators_1.0.7      lattice_0.20-29
> [19] latticeExtra_0.6-26  locfit_1.5-9.1       plyr_1.8.1
> [22] RColorBrewer_1.0-5   R.methodsS3_1.6.1    R.oo_1.18.0
> [25] RSQLite_0.11.4       sendmailR_1.1-2      splines_3.1.0
> [28] stats4_3.1.0         stringr_0.6.2        survival_2.37-7
> [31] tools_3.1.0          XML_3.98-1.1         xtable_1.7-3
> [34] zlibbioc_1.10.0
>
> --
> Sent via the guest posting facility at bioconductor.org.



More information about the Bioconductor mailing list