[BioC] Annotation databases--unable to get annotations

James W. MacDonald jmacdon at uw.edu
Wed Oct 10 21:26:26 CEST 2012


Hi Manjula,

On 10/10/2012 2:31 PM, Manjula Kasoji [guest] wrote:
> Has anyone used the mogene10stprobeset.db annotation package?
>
> I have a list of significant probeset ids that I obtained from running an analysis in the limma package. When I try to use mget() to retrieve the corresponding gene names only a couple of probesets get annotated. I've tried with several different gene lists. In a gene list of about 300 genes, only 2 get annotated at the most. The rest of the probesets get a "NA" for the annotation.
>
> Here is my R code:

That's not all your code, just some. You might also note that you used 
the affy package to summarize, in which case you have summarized at the 
transcript level, not the probeset level. So you are using the wrong 
annotation package. Try the mogene10sttranscriptcluster.db package and 
see how that works.

Best,

Jim


>
> ## load library
>> library("mogene10stprobeset.db")
> ## map probe ids to gene names...
>
> #extract ids from signifcant gene list
> ids=sig.gene.list[["ID"]]
>
> #if not in character form, change to character
> ids = as.character(ids)
>
> #retrieve gene names from annotation package for ids in signficant gene list
> mget(ids, mogene10stprobesetGENENAME, ifnotfound=NA)
>
>
> I've tried ENTREZID, SYMBOL, and GO, all of them return annotations just for a few probesets.
>
> I'm using R version R 2.15.1 for Macs.
>
> Any help would be appreciated!!
>
> Thanks!
>
>   -- output of sessionInfo():
>
>> sessionInfo()
> R version 2.15.1 (2012-06-22)
> Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)
>
> locale:
> [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
>
> attached base packages:
> [1] stats     graphics  grDevices utils     datasets  methods   base
>
> other attached packages:
>   [1] hgu95av2.db_2.7.1           org.Hs.eg.db_2.8.0          limma_3.14.0                mogene10stv1cdf_2.11.0      affy_1.36.0                 mogene10stprobeset.db_8.0.1 org.Mm.eg.db_2.8.0
>   [8] RSQLite_0.11.2              DBI_0.2-5                   AnnotationDbi_1.20.0        Biobase_2.18.0              BiocGenerics_0.4.0          BiocInstaller_1.8.2
>
> loaded via a namespace (and not attached):
> [1] affyio_1.26.0         IRanges_1.16.2        parallel_2.15.1       preprocessCore_1.20.0 stats4_2.15.1         tools_2.15.1          zlibbioc_1.4.0
> --
> Sent via the guest posting facility at bioconductor.org.
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099



More information about the Bioconductor mailing list