[BioC] xps - number of probesets for HuGene10ST array

Michael Walter michael.walter at med.uni-tuebingen.de
Fri May 15 14:28:10 CEST 2009


Dear List,

I have another question with the xps package. I installed root and xps, created the root schemes for the HuGene1.0 array and imported the first CEL files following the example scripts. Everything works fine to that point. However, after RMA normalization I'm missing a couple of probesets. When I use the Affymetrix expression console you get 33297 probesets. 4201 of these are +ve and -ve controls leaving 29096 probesets. With the xps package I have 28926 probesets with only 57 controls. Thus I'm missing 4371 and 227 probeset, respectively. I already checked the archives and found a similar question. However, the number of probesets i get is with exonlevel="all". I attached the code and session info below. My question now is: Where are the missing probesets, since I use the very same plg and clf files for xps and expression console?

Thanks for your thoughts,

Michael


R code:

library(xps)

xpsdir = "C:/xps"
scmdir=paste(xpsdir, "schemes", sep="/")
libdir= paste(xpsdir, "library/HuGene10ST", sep="/")
anndir= paste(xpsdir, "annotation", sep="/")


scheme.hugene10stv1r4.na27 <- import.exon.scheme(
	"Scheme_HuGene10stv1r4_na27_2",
	filedir=scmdir,
	layoutfile=paste(libdir,"HuGene-1_0-st-v1.r4.clf",sep="/"),
	schemefile=paste(libdir,"HuGene-1_0-st-v1.r4.pgf",sep="/"),
	probeset=paste(anndir,"HuGene-1_0-st-v1.na27.2.hg18.probeset.csv",sep="/"),
      transcript=paste(anndir,"HuGene-1_0-st-v1.na27.hg18.transcript.csv",sep="/"),
	verbose=TRUE)

celdir <- getwd()

scheme.HuGene10 <- root.scheme(paste("C:/xps/schemes","Scheme_HuGene10stv1r4_na27_2.root",sep="/"))
data.test <- import.data(scheme.HuGene10, "DataTest", celdir=celdir)
str(data.test)

data.test2 <- attachMask(data.test)
data.test2 <- attachInten(data.test)

data.test2 <- removeInten(data.test2)
data.test2 <- removeMask(data.test2)

data.rma <- rma(data.test2, "tmpdt_Test2RMA", background="antigenomic", normalize=T, 
	exonlevel="all", verbose = FALSE)

expr.rma <- validData(data.rma)

call.dabg <- dabg.call(data.test2, "tmpdt_Test2DABG",exonlevel="all", verbose = FALSE)

Session Info:

R version 2.8.1 (2008-12-22) 
i386-pc-mingw32 

locale:
LC_COLLATE=German_Germany.1252;LC_CTYPE=German_Germany.1252;LC_MONETARY=German_Germany.1252;LC_NUMERIC=C;LC_TIME=German_Germany.1252

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] xps_1.2.6

-- 
Dr. Michael Walter

The Microarray Facility
University of Tuebingen
Calwerstr. 7
72076  Tübingen/GERMANY

Tel.: +49 (0) 7071 29 83210
Fax. + 49 (0) 7071 29 5228

Confidentiality Note:\ This message is intended only for...{{dropped:9}}



More information about the Bioconductor mailing list