[BioC] pd.hugene.1.0.st.v1

Mark Robinson mrobinson at wehi.EDU.AU
Fri Jul 31 06:48:18 CEST 2009


Hi all.

I wonder if its makes more sense to have the *transcript* version of  
this package, instead of the *probeset* version available when you  
install via:

source("http://bioconductor.org/biocLite.R")
biocLite("pd.hugene.1.0.st.v1")

It seems like as a default, more people would want gene-level  
summaries for these arrays ... especially since ~200k (~80%) of the  
probesets have 3 probes or less.

Of course I (and everyone around the world) could build this package  
locally from scratch using the transcript CSV, but it seems like there  
would be enough demand for this to make available direct from BioC.   
Just a thought.  Does anyone agree?

Or, am I missing something that will allow me to do gene-level  
analysis from this package?

My session is below.

Thanks in advance.
Mark



----------------------
mac1618:Desktop mrobinson$ wc -l HuGene-1_0-st-v1.na29.*.csv
   257449 HuGene-1_0-st-v1.na29.hg18.probeset.csv
    33317 HuGene-1_0-st-v1.na29.hg18.transcript.csv
----------------------


----------------------
 > library(oligo)
Loading required package: oligoClasses
Loading required package: Biobase

Welcome to Bioconductor

   Vignettes contain introductory material. To view, type
   'openVignette()'. To cite Bioconductor, see
   'citation("Biobase")' and for packages 'citation(pkgname)'.

Loading required package: preprocessCore
Welcome to oligo version 1.8.1
 > cf <- dir(celPath,"CEL")
 > fs <- read.celfiles( file.path(celPath,cf) )
Loading required package: pd.hugene.1.0.st.v1
Loading required package: RSQLite
Loading required package: DBI
Platform design info loaded.
Reading in : rawData/cell_line/HuGene-1_0-st-v1//cancer1.CEL
Reading in : rawData/cell_line/HuGene-1_0-st-v1//cancer2.CEL
Reading in : rawData/cell_line/HuGene-1_0-st-v1//normal1.CEL
Reading in : rawData/cell_line/HuGene-1_0-st-v1//normal2.CEL
 > rmaOligo <- oligo::rma(fs)
Background correcting
Normalizing
Calculating Expression
dmOligo <- exprs(rmaOligo)
dim(rmaOligo)
 > dmOligo <- exprs(rmaOligo)
 > dim(rmaOligo)
Features  Samples
   253002        4
 > sessionInfo()
R version 2.9.0 (2009-04-17)
i386-apple-darwin8.11.1

locale:
en_AU.UTF-8/en_AU.UTF-8/C/C/en_AU.UTF-8/en_AU.UTF-8

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

other attached packages:
[1] pd.hugene.1.0.st.v1_2.4.1 RSQLite_0.7-1
[3] DBI_0.2-4                 oligo_1.8.1
[5] preprocessCore_1.6.0      oligoClasses_1.6.0
[7] Biobase_2.4.1

loaded via a namespace (and not attached):
[1] affxparser_1.15.6 affyio_1.12.0     Biostrings_2.12.1 IRanges_1.2.2
[5] splines_2.9.0
----------------------







------------------------------
Mark Robinson, PhD (Melb)
Epigenetics Laboratory, Garvan
Bioinformatics Division, WEHI
e: m.robinson at garvan.org.au
e: mrobinson at wehi.edu.au
p: +61 (0)3 9345 2628
f: +61 (0)3 9347 0852



More information about the Bioconductor mailing list