[BioC] justRMA?

James W. MacDonald jmacdon at uw.edu
Mon Dec 10 15:26:12 CET 2012


Hi Bhargavi,

On 12/9/2012 1:38 AM, Bhargavi Duvvuri wrote:
> Hello James,
>
> I have tested as you mentioned below:
>
> > nrow(exprs(justRMA(cdfname = "PrimeViewHsREFSEQcdf")))
> Loading required package: AnnotationDbi
>
> [1] 34901
>
> This number is exactly the same as in the CDF file and matches with 
> that of RMA.
>
> I get 34901 rows when I process single CEL file.  However, when I do 
> justRMA with 350 CEL files, number of rows in the output are 23068. 
> Why would this happen?

It shouldn't. I don't have 350 of the same type of celfile to test this 
with, so I would suggest trying a subset (like 100) and see how that 
goes. You might be getting an error that you didn't notice.

Best,

Jim


>
> Please advise.
>
> Thank you
>
> Bhargavi
>
> On Fri, Dec 7, 2012 at 4:42 PM, James W. MacDonald <jmacdon at uw.edu 
> <mailto:jmacdon at uw.edu>> wrote:
>
>     Hi Bargavi,
>
>
>     On 12/7/2012 4:08 PM, Bhargavi Duvvuri wrote:
>
>         Hello,
>
>         I am using justRMA for processing 350 CEL files. When I does
>         with RMA I get
>         34901 probe intensities  which match the CDF file. However,
>         when I run
>         justRMA, I get only 23068 probe intensities. I am not sure why
>         would this
>         happen?
>
>
>     It shouldn't, and we have done extensive testing to ensure that
>     the results from justRMA() are identical to rma(). As an example:
>
>     > nrow(exprs(rma(ReadAffy(cdfname = "mogene10stmmrefseqcdf"))))
>     Background correcting
>     Normalizing
>     Calculating Expression
>     [1] 28312
>     > nrow(exprs(justRMA(cdfname = "mogene10stmmrefseqcdf")))
>     [1] 28312
>
>     So I get the same number of rows using both methods. Now check
>     that I get the same exact values:
>
>     > all.equal(exprs(rma(ReadAffy(cdfname =
>     "mogene10stmmrefseqcdf"))),exprs(justRMA(cdfname =
>     "mogene10stmmrefseqcdf")))
>     Background correcting
>     Normalizing
>     Calculating Expression
>     [1] TRUE
>
>     So it looks OK to me.
>
>     Best,
>
>     Jim
>
>     > sessionInfo()
>     R version 2.15.1 (2012-06-22)
>     Platform: x86_64-unknown-linux-gnu (64-bit)
>
>     locale:
>      [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
>      [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
>      [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
>      [7] LC_PAPER=C                 LC_NAME=C
>      [9] LC_ADDRESS=C               LC_TELEPHONE=C
>     [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
>     attached base packages:
>     [1] stats     graphics  grDevices utils     datasets  methods   base
>
>     other attached packages:
>     [1] mogene10stmmrefseqcdf_16.0.0 mogene11stmmrefseqcdf_16.0.0
>     [3] mogene10stv1cdf_2.11.0       AnnotationDbi_1.20.3
>     [5] affy_1.36.0                  Biobase_2.18.0
>     [7] BiocGenerics_0.4.0
>
>     loaded via a namespace (and not attached):
>      [1] affyio_1.26.0         BiocInstaller_1.8.3   DBI_0.2-5
>      [4] IRanges_1.16.4        parallel_2.15.1       preprocessCore_1.20.0
>      [7] RSQLite_0.11.2        stats4_2.15.1         tools_2.15.1
>     [10] zlibbioc_1.4.0
>     >
>
>
>
>         Below is the code I am using:
>
>         library(affy)
>             setwd('/home/run/testcel')
>              expr.vals<- justRMA(cdfname = "primeviewhsrefseqcdf")
>              write.exprs(expr.vals, file= 'Customoutput.csv', sep=",",
>         row.names=F,
>         col.names=T, quote=F)
>
>         Could you please advise me here on how to modify the script so
>         that I
>         get normalized probe intensities for all the  probe sets?
>
>         Thank you for your attention and time.
>
>         Bhargavi
>
>                 [[alternative HTML version deleted]]
>
>         _______________________________________________
>         Bioconductor mailing list
>         Bioconductor at r-project.org <mailto:Bioconductor at r-project.org>
>         https://stat.ethz.ch/mailman/listinfo/bioconductor
>         Search the archives:
>         http://news.gmane.org/gmane.science.biology.informatics.conductor
>
>
>     -- 
>     James W. MacDonald, M.S.
>     Biostatistician
>     University of Washington
>     Environmental and Occupational Health Sciences
>     4225 Roosevelt Way NE, # 100
>     Seattle WA 98105-6099
>
>
>
>
> -- 
> Bhargavi Duvvuri M.Sc, Ph.D
> TAS/CAN Postdoctoral Research Fellow
> The Hospital for Sick Children Research Institute
> Division of Cell Biology
> MARS Centre - Toronto Medical Discovery Tower
> 101 College street, Rm 12-401 Bay C
> Toronto, ON, Canada M5G 1L7
> Phone: 416-813-7780
> Fax: 416-813-8883
> email: bhargavi.duvvuri at sickkids.ca <mailto:bhargavi.duvvuri at sickkids.ca>
>

-- 
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099



More information about the Bioconductor mailing list