[BioC] Problem with invalid GO term in HyperGResult object - NOT solved

Jenny Drnevich drnevich at uiuc.edu
Thu Sep 20 18:33:58 CEST 2007


Hi Herve,

Thanks for letting me know the new Windows binaries were available. 
However, I traced the problem to the new annotation package for 
ath1121501 - it is USELESS for GOstats testing!! Arabidopsis doesn't 
use EntrezIDs, but instead uses AGI locus identifier, given in the 
ath1121501ACCNUM environment. This environment used to give unique 
IDs for almost all probe sets, but now all of the listings are either 
"multiple" or NA:

 > probes <- ls(ath1121501ACCNUM)
 > probes[1:10]
  [1] "244901_at"   "244902_at"   "244903_at"   "244904_at"   "244905_at"
  [6] "244906_at"   "244907_at"   "244908_at"   "244909_at"   "244910_s_at"
 > length(probes)
[1] 22810
 > locusList <- unique(unlist(mget(probes, ath1121501ACCNUM)))
 > length(locusList)
[1] 2
 > locusList
[1] "multiple" NA


I realize this is probably came from the Arabidopsis database used to 
create the annotation package, but is there any way to "fix" this, or 
to put up the older annotation package?  The old one had a few 
hundred "multiple" entries, but I figured it wouldn't matter that 
much to throw them out. Any other ideas for ways around this problem?

Thanks,
Jenny

 > sessionInfo()
R version 2.6.0 Under development (unstable) (2007-08-28 r42679)
i386-pc-mingw32

locale:
LC_COLLATE=English_United States.1252;LC_CTYPE=English_United 
States.1252;LC_MONETARY=English_United 
States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252

attached base packages:
[1] splines   tools     stats     graphics  grDevices utils     datasets
[8] methods   base

other attached packages:
  [1] limma_2.11.13        ath1121501_1.99.10   GOstats_2.3.17
  [4] Category_2.3.39      genefilter_1.15.11   survival_2.32
  [7] RBGL_1.13.6          annotate_1.15.11     xtable_1.5-1
[10] GO.db_1.99.3         AnnotationDbi_0.99.1 RSQLite_0.6-2
[13] DBI_0.2-3            graph_1.15.20        Biobase_1.15.34

loaded via a namespace (and not attached):
[1] cluster_1.11.7     simpleaffy_2.13.01





Jenny Drnevich, Ph.D.

Functional Genomics Bioinformatics Specialist
W.M. Keck Center for Comparative and Functional Genomics
Roy J. Carver Biotechnology Center
University of Illinois, Urbana-Champaign

330 ERML
1201 W. Gregory Dr.
Urbana, IL 61801
USA

ph: 217-244-7355
fax: 217-265-5066
e-mail: drnevich at uiuc.edu



More information about the Bioconductor mailing list