[BioC] problem with GO annotation data
    Nianhua Li 
    nli at fhcrc.org
       
    Fri Jul 28 03:16:54 CEST 2006
    
    
  
Hi, Shiliang,
You were right that AnnBuilder did have a bug. Thanks for the report!
The bug is related to Weijun's question: 
http://thread.gmane.org/gmane.science.biology.informatics.conductor/9227/focus=9232
When baseType is "ll", the baseFile can't be parsed correctly because 
some parsers assume Entrez Gene is at the 3rd column of the base file. 
Thanks for John's patch, AnnBuilder just quietly converts the base file 
into three columns when baseType is "ll". So, weijun's problem got 
solved. Unfortunately, the parser used for "go2probe" mapping assumes a 
two-column setting for baseType "ll", therefore can't work correctly 
with the patch.
Now the problem has been solved. You can either get the up-to-date 
AnnBuilder (v 1.11.6) from svn or wait still Saturday noon to download 
it using biocLite or from website. You can also modify your local source 
AnnBuilder/R/ABPkgBuilder.R if you are in a hurry:
===================================================================
--- ABPkgBuilder.R      (v1.11.5 or before)
+++ ABPkgBuilder.R      (v1.11.6)
# Map GO ids to probe ids that are directly associated with the GO ids
 mapGO2Probe <- function(eg, baseMapType){
-     if(baseMapType == "ll"){
-            parser(eg) <- file.path(.path.package("AnnBuilder"),
-                                    "scripts", "GO2ProbeParser4LL")
-        }else{
-            parser(eg) <- file.path(.path.package("AnnBuilder"),
+        parser(eg) <- file.path(.path.package("AnnBuilder"),
                                     "scripts", "GO2ProbeParser")
-        }
         options(show.error.messages = FALSE)
         go2Probe <- try(parseData(eg, eg at go, ncol = 3))
         options(show.error.messages = TRUE)
====================================================================
Also the probe number mismatch is because your base file has duplicated 
probeset identifiers.
help it works this time :)
nianhua
    
    
More information about the Bioconductor
mailing list