[BioC] pdInfoBuilder fails on the new Arabidopsis Gene ST 1.0 & 1.1 arrays

Groot, Philip de philip.degroot at wur.nl
Wed Mar 14 12:11:53 CET 2012


Dear Benilton,



I tried to create the pdInfoBuilder packages for the Affymetrix  Arabidopsis ST 1.0 arrays and did run into a problem that I cannot solve myself. I guess that it also affects the Arabidopsis ST 1.1 cartridges, so you might want to take a look. I pasted the appropriate information below.



Regards,



Dr. Philip de Groot Ph.D.
Bioinformatics Researcher

Wageningen University / TIFN
Nutrigenomics Consortium
Nutrition, Metabolism & Genomics Group
Division of Human Nutrition
PO Box 8129, 6700 EV Wageningen
Visiting Address: Erfelijkheidsleer: De Valk, Building 304
Dreijenweg 2, 6703 HA Wageningen
Room: 0052a
T: +31-317-485786
F: +31-317-483342
E-mail: Philip.deGroot at wur.nl<mailto:Philip.deGroot at wur.nl>
Internet: http://www.nutrigenomicsconsortium.nl<http://www.nutrigenomicsconsortium.nl/>
http://humannutrition.wur.nl/
https://madmax.bioinformatics.nl/





Information regarding the chip on the Affymetrix website:
Design time annotation file:
http://www.affymetrix.com/Auth/analysis/downloads/na32/wtgene/AraGene-1_0-st-v1.annotations.design-time.20111019.zip

Library files:
http://www.affymetrix.com/Auth/analysis/downloads/lf/wt/AraGene-1_0-st-v1/AraGene-1_0-st_rev01.zip



The code that I utilized in R:

> library("pdInfoBuilder")
Loading required package: Biobase

Welcome to Bioconductor

  Vignettes contain introductory material. To view, type
  'browseVignettes()'. To cite Bioconductor, see
  'citation("Biobase")' and for packages 'citation("pkgname")'.

Loading required package: RSQLite
Loading required package: DBI
Loading required package: affxparser
Loading required package: oligo
Loading required package: oligoClasses
Welcome to oligoClasses version 1.16.0
================================================================================
Large dataset support for 'oligo/crlmm': Disabled
     - Load 'ff'
================================================================================
Parallel computing support for 'oligo/crlmm': Disabled
     - Load 'ff'
     - Load 'snow'
     - Use options(cluster=makeCluster(...))
================================================================================
Loading package bit 1.1-8
package:bit (c) 2008/2009 Jens Oehlschlaegel (GPL-2)
creators: bit bitwhich
coercion: as.logical as.integer as.bit as.bitwhich which
operator: ! & | xor != ==
querying: print length any all min max range sum summary
bit access: length<- [ [<- [[ [[<-
for more help type ?bit
Loading package ff2.2-5
- getOption("fftempdir")=="/tmp/RtmpBnumMg"

- getOption("ffextension")=="ff"

- getOption("ffdrop")==TRUE

- getOption("fffinonexit")==TRUE

- getOption("ffpagesize")==65536

- getOption("ffcaching")=="mmnoflush"  -- consider "ffeachflush" if your system stalls on large writes

- getOption("ffbatchbytes")==16777216 -- consider a different value for tuning your system

- getOption("ffmaxbytes")==536870912 -- consider a different value for tuning your system

================================================================================
Welcome to oligo version 1.18.1
================================================================================
>
> pgfFile<-"./AraGene-1_0-st.pgf"
> clfFile<-"./AraGene-1_0-st.clf"
> probeFile<-"./AraGene-1_0-st-v1.design-time.20111019.probeset.csv"
> transFile<-"./AraGene-1_0-st-v1.design-time.20111019.transcript.csv"
> mpsFile <- "./AraGene-1_0-st.mps"
>
> pkg<-new("AffyGenePDInfoPkgSeed", version="1.0.0", author="Philip de Groot", email="Philip.deGroot at wur.nl<mailto:Philip.deGroot at wur.nl>", chipName="aragene", genomebuild="Tair10", species="Arabidopsis thaliana" , clfFile=clfFile, pgfFile=pgfFile, coreMps=mpsFile, probeFile=probeFile, transFile=transFile)
>
> makePdInfoPackage(pkg, destDir=".")
================================================================================
Building annotation package for Affymetrix Gene ST Array
PGF.........: AraGene-1_0-st.pgf
CLF.........: AraGene-1_0-st.clf
Probeset....: AraGene-1_0-st-v1.design-time.20111019.probeset.csv
Transcript..: AraGene-1_0-st-v1.design-time.20111019.transcript.csv
Core MPS....: AraGene-1_0-st.mps
================================================================================
Parsing file: AraGene-1_0-st.pgf... OK
Parsing file: AraGene-1_0-st.clf... OK
Creating initial table for probes... OK
Creating dictionaries... OK
Parsing file: AraGene-1_0-st-v1.design-time.20111019.probeset.csv... OK
Parsing file: AraGene-1_0-st.mps... OK
Creating package in ./pd.aragene.1.0.st
Inserting 1200 rows into table chrom_dict... OK
Inserting 5 rows into table level_dict... OK
Inserting 8 rows into table type_dict... OK
Inserting 181946 rows into table core_mps... OK
Inserting 179330 rows into table featureSet... Error in sqliteExecStatement(con, statement, bind.data) :
  RS-DBI driver: (RS_SQLite_exec: could not execute: datatype mismatch)
In addition: Warning messages:
1: In is.na(x) : is.na() applied to non-(list or vector) of type 'NULL'
2: In is.na(x) : is.na() applied to non-(list or vector) of type 'NULL'

> sessionInfo()
R version 2.14.0 (2011-10-31)
Platform: x86_64-unknown-linux-gnu (64-bit)

locale:
 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
 [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
 [7] LC_PAPER=C                 LC_NAME=C
 [9] LC_ADDRESS=C               LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

other attached packages:
[1] pdInfoBuilder_1.18.0 oligo_1.18.1         oligoClasses_1.16.0
[4] affxparser_1.26.4    RSQLite_0.11.1       DBI_0.2-5
[7] Biobase_2.14.0

loaded via a namespace (and not attached):
[1] affyio_1.22.0         Biostrings_2.22.0     bit_1.1-8
[4] ff_2.2-5              IRanges_1.12.6        preprocessCore_1.16.0
[7] splines_2.14.0        zlibbioc_1.0.1



More information about the Bioconductor mailing list