[BioC] How remove controls probes - Affymetrix Gene

James W. MacDonald jmacdon at med.umich.edu
Fri Jan 13 19:04:37 CET 2012


Hi Sophie,

On 1/12/2012 11:01 AM, Sophie Lamarre [guest] wrote:
> Hello,
>
> I work on affymetrix mouse gene 1.0 ST.
>
> I want to know how remove controls probes on Affymetrix Gene.
> In my studie, I downloaded two files on Affymetrix website:
> - The Probeset Annotations, CSV Format, Release 32
>   file (NetAffx Analysis Center ->  Exon/Gene ->  NetAffy Query ->  "Mouse Gene 1.0 ST")
> - The Transcript Cluster Annotations, CSV, Release 32 file (NetAffx Analysis Center ->  Exon/Gene ->  NetAffy Query ->  "Mouse Gene 1.0 ST")
>
> With the first, I had 28 856 main probesets and with second 28 350 main probesets.
>
> I add that I run a RMA option "core" with the oligo package. I just want a probe for a probeset.

I'm not sure what you are asking here. You can't have a probe for a 
probeset, as there are multiple probes in each probeset. The default for 
rma() in oligo is to summarize the probesets at the transcript level, 
which is in general what most people want. You can summarize at the 
'probeset' level, which is something like summarizing at the exon level, 
but if I am not mistaken, there can be more than one probeset per exon, 
so that gets complicated. Summarizing at the transcript level is an 
easier path to take.

>
> Which file I have to use? (I did'nt use the command getNetAffy because I realized that I lost more probesets! (There were only 28 270 main probes)).

Rather than getting annotation data from netaffx and then struggling 
with the format, it is easier to install the 
mogene10sttranscriptcluster.db package, and use that for annotating your 
data. There are vignettes in the AnnotationDbi package that show how to 
use that package.

Best,

Jim



>
> Thank you for advance for your answer and your help.
>
> Sophie LAMARRE
>
>   -- output of sessionInfo():
>
> R version 2.13.0 (2011-04-13)
> Platform: i386-pc-mingw32/i386 (32-bit)
>
> locale:
> [1] LC_COLLATE=French_France.1252  LC_CTYPE=French_France.1252    LC_MONETARY=French_France.1252
> [4] LC_NUMERIC=C                   LC_TIME=French_France.1252
>
> attached base packages:
> [1] grid      stats     graphics  grDevices utils     datasets  methods   base
>
> other attached packages:
>   [1] pd.mogene.1.0.st.v1_3.2.0 oligo_1.16.2              preprocessCore_1.14.0
>   [4] GOstats_2.18.0            RSQLite_0.9-4             DBI_0.2-5
>   [7] graph_1.30.0              Category_2.18.0           genefilter_1.34.0
> [10] gplots_2.8.0              caTools_1.12              bitops_1.0-4.1
> [13] gdata_2.8.2               gtools_2.6.2              geneplotter_1.30.0
> [16] lattice_0.19-23           annotate_1.30.0           AnnotationDbi_1.14.1
> [19] RColorBrewer_1.0-5        ArrayTools_1.12.0         affy_1.30.0
> [22] limma_3.8.3               oligoClasses_1.14.0       Biobase_2.12.2
>
> loaded via a namespace (and not attached):
>   [1] affxparser_1.24.0 affyio_1.20.0     Biostrings_2.20.3 bit_1.1-7         ff_2.2-3
>   [6] GO.db_2.5.0       GSEABase_1.14.0   IRanges_1.10.6    RBGL_1.28.0       splines_2.13.0
> [11] survival_2.36-5   tools_2.13.0      XML_3.4-2.2       xtable_1.5-6
>
> --
> Sent via the guest posting facility at bioconductor.org.
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Douglas Lab
University of Michigan
Department of Human Genetics
5912 Buhl
1241 E. Catherine St.
Ann Arbor MI 48109-5618
734-615-7826

**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues 



More information about the Bioconductor mailing list