[BioC] AnnBuilder package: problem with gbNRef

Craddock, Richard C. (CDC/NCID/VR) (CTR) cmi5 at cdc.gov
Wed Jul 26 00:22:45 CEST 2006


Sorry to send the original in html.  Hopefully this works better ....

-cameron

------------------------------------------------------------------------
----

Hello all,

I am having a similar problem to Weijun's but the posted fixes are not
working for me.  I am trying to build a "gbNRef" package but the
annotation information is missing.  I have upgraded to AnnBuilder 1.11.5
from svn, and I have tried both the two and three column base file.  I
have intermittent problems connecting to the NCBI ftp site, so I copied
the necessary files locally.  Here is the code I am using:

myBase <- "/home/cameron/microarray_data/mwg40ka_basname_named.tdf"
myBaseType <- "gbNRef"
 

mySrcUrls <- getSrcUrl("all", "Homo sapiens")
mySrcUrls[[7]]<-"file:///home/cameron/microarray_data/annotate"
mySrcUrls[[2]]<-"file:///home/cameron/microarray_data/annotate/UniGene"
mySrcUrls[[4]]<-"file:///home/cameron/microarray_data/annotate/KEGG/path
ways"
 

myDir <- "/home/cameron/microarray_data/mwgAnnotate"
 

ABPkgBuilder( baseName=myBase, srcUrls=mySrcUrls,
baseMapType=myBaseType, pkgName="mwg40kA", pkgPath=myDir, organism="Homo
sapiens", version="0.10", author=list(authors="R. Cameron Craddock",
maintainer="R. Cameron Craddock <cmi5 at cdc.gov>"), fromWeb=TRUE)



Here is the output from ABPkgBuilder:

Attaching package: 'GO'
 
 
        The following object(s) are masked from package:AnnBuilder :
 
         GO
 
Read 1 item
Read 1 item
Failed to get data from URL:
ftp://ftp.genome.ad.jp/pub/kegg/pathways/hsa/hsa00195.gene
Failed to get data from URL:
ftp://ftp.genome.ad.jp/pub/kegg/pathways/hsa/hsa00231.gene
Failed to get data from URL:
ftp://ftp.genome.ad.jp/pub/kegg/pathways/hsa/hsa00253.gene
   ...
   ...   ( removed a bunch of others )
   ...
Failed to get data from URL:
ftp://ftp.genome.ad.jp/pub/kegg/pathways/hsa/hsa07217.gene
[1] "4028 2 2"

The following data sets have been added to the database and will be
removed:
 [1]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAACCNUM.rd
a"
 [2]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kACHRLENGTH
S.rda"
 [3]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kACHRLOC.rd
a"
 [4]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAENZYME.rd
a"
 [5]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kALOCUSID.r
da"
 [6]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAMAPCOUNTS
.rda"
 [7]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAORGANISM.
rda"
 [8]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAPATH.rda"
 [9]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAPFAM.rda"
[10]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAPROSITE.r
da"
[11]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAQCDATA.rd
a"
[12]
"/home/cameron/microarray_data/mwgAnnotate/mwg40kA/data/mwg40kAQC.rda"

None of the files listed in the above warnings exist at the specified
location.  I verified this using ncftp.  After ABPkgBuilder finishes I
perform the following steps:

R CMD check mwg40kA/
(only warning is that data directory is empty)

R CMD build mwg40kA/
(no errors, no warnings)

R CMD INSTALL mwg30kA_0.10.tar.gz

* Installing *source* package 'mwg40kA' ...
** R
** data
**  moving datasets to lazyload DB
** help
 >>> Building/Updating help pages for package 'mwg40kA'
     Formats: text html latex example
  mwg40kA                           text    html    latex
  mwg40kAACCNUM                     text    html    latex   example
  mwg40kACHRLENGTHS                 text    html    latex   example
  mwg40kACHRLOC                     text    html    latex   example
  mwg40kAENZYME                     text    html    latex   example
  mwg40kALOCUSID                    text    html    latex   example
  mwg40kAORGANISM                   text    html    latex   example
  mwg40kAPATH                       text    html    latex   example
  mwg40kAPFAM                       text    html    latex   example
  mwg40kAPROSITE                    text    html    latex   example
  mwg40kAQC                         text    html    latex
  mwg40kAQCDATA                     text    html    latex
** building package indices ...
* DONE (mwg40kA)

This is what I get when I load the library:

> library(mwg40kA)
> mwg40kA()
 

 

Quality control information for  mwg40kA
Date built: Created: Tue Jul 25 16:48:06 2006
 

Number of probes: 20160
Probe number missmatch: None
Probe missmatch: None
Mappings found for probe based rda files:
         mwg40kAACCNUM found 19760 of 20160
         mwg40kACHRLOC found 0 of 20160
         mwg40kAENZYME found 0 of 20160
         mwg40kALOCUSID found 0 of 20160
         mwg40kAPATH found 0 of 20160
Mappings found for non-probe based rda files:
         mwg40kACHRLENGTHS found 25
         mwg40kAORGANISM found 1
         mwg40kAPFAM found 0
         mwg40kAPROSITE found 0
 

The mwg40kAACCNUM environment matches my basefile.  Can anyone suggest a
solution to my problem?

Thanks for your help,

Cameron



More information about the Bioconductor mailing list