[BioC] HGU133Plus2 CDF vs hgu133plus2hsentrezgcdf CDF (30% difference in results)

Mahes Muniandy [guest] guest at bioconductor.org
Sat Sep 13 20:31:55 CEST 2014

My name is Mahes Muniandy and I am a doctoral student. I have been analysing Affymetrix HGU133Plus2 cel files to determine differential expressions in twin pairs (within pair differences). I have used affy, gcrma, nsfilter and limma to do my analysis. I have run my analysis using the HGU133plus2 CDF available in biocondutor and then tried the whole analysis again using the HGU133plus2 cdf from Brainarray. The limma results differ significantly (2351 differentially expressed genes for the former and 2700  genes for the latter analysis). 630 genes (about 30%) from the 2351 genes do not exist in the list of 2700 genes.

I have read "Evolving Gene/Transcript Definitions Significantly Alter the Interpretation of GeneChip Data  M. Dai  et al." and see some convincing arguments there. But, I am confused with which limma results to go with. Could you advise me on the guiding principles that I should follow in order to decide which cdf to use. I do realise that the onus is on me to decide but sadly, I am quite lost in this matter. I would appreciate any help available.

Many Thanks,
Mahes Muniandy,
Uni. Helsinki

 -- output of sessionInfo(): 

> sessionInfo()
R version 3.0.2 (2013-09-25)
Platform: x86_64-unknown-linux-gnu (64-bit)

 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
 [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
 [9] LC_ADDRESS=C               LC_TELEPHONE=C

attached base packages:
[1] parallel  stats     graphics  grDevices utils     datasets  methods
[8] base

other attached packages:
[1] genefilter_1.44.0  limma_3.18.13      gcrma_2.34.0       affy_1.40.0
[5] Biobase_2.22.0     BiocGenerics_0.8.0

loaded via a namespace (and not attached):
 [1] affyio_1.30.0         annotate_1.40.1       AnnotationDbi_1.24.0
 [4] BiocInstaller_1.12.1  Biostrings_2.30.1     DBI_0.2-7
 [7] IRanges_1.20.7        preprocessCore_1.24.0 RSQLite_0.11.4
[10] splines_3.0.2         stats4_3.0.2          survival_2.37-7
[13] XML_3.98-1.1          xtable_1.7-3          XVector_0.2.0
[16] zlibbioc_1.8.0

Sent via the guest posting facility at bioconductor.org.

More information about the Bioconductor mailing list