[BioC] Some problems in combining ExpressionSet objects with the combine() function
James W. MacDonald
jmacdon at med.umich.edu
Wed Jul 27 16:42:13 CEST 2011
You don't give the output from sessionInfo(), but I suspect you have an
outdated version of BioC.
ExpressionSet (storageMode: lockedEnvironment)
assayData: 500 features, 154 samples
element names: exprs
sampleNames: GSM101849 GSM101851 ... GSM327326 (154 total)
varLabels: position satelite
experimentData: use 'experimentData(object)'
1: In alleq(levels(x[[nm]]), levels(y[[nm]])) : 2 string mismatches
2: data frame column 'satelite' levels not all.equal
R version 2.13.0 alpha (2011-03-28 r55140)
Platform: x86_64-unknown-linux-gnu (64-bit)
 LC_CTYPE=en_US.iso885915 LC_NUMERIC=C
 LC_TIME=en_US.iso885915 LC_COLLATE=en_US.iso885915
 LC_MONETARY=C LC_MESSAGES=en_US.iso885915
 LC_PAPER=en_US.iso885915 LC_NAME=C
 LC_ADDRESS=C LC_TELEPHONE=C
 LC_MEASUREMENT=en_US.iso885915 LC_IDENTIFICATION=C
attached base packages:
 stats graphics grDevices utils datasets methods base
other attached packages:
 affy_1.29.3 Biobase_2.11.10
loaded via a namespace (and not attached):
 affyio_1.19.4 preprocessCore_1.13.6
On 7/27/2011 10:17 AM, Manuela Di Russo wrote:
> Hi all,
> I'm struggling with the meta-analysis of microarray data. In the user guide of the bioconductor package "MAMA" there is an exemple of meta-analysis in cancer microarray data concerning the comparison of expression profiles in MSI (microsatelite instable) and MSS (microsatelite stable) colon cancer. Data are gathered from three microarray experiments from public databases
> The original data sets have been preprocessed and subsampled (same set of 500 genes). All data sets have been normalized and are in log2-scale. The corresponding sample sizes for the three datasets, named denmark, australia and japan, are 77 (39 MSI and 38 MSS), 36 (5 MSI and 31 MSS) and 41 (16 MSI and 25 MSS), respectively. Each of the datasets is stored as an ExpressionSet object.
> I would like to perform a meta-analysis combining the three ExpressionSets into one and using the limma package with a convenient design matrix to model the study effect.
> I have two questions:
> 1) I tried using the function "combine" in order to combine the three datasets (two by two, too) but I obtain the following error message:
> Error in data.frame(<S4 object of class "ExpressionSet">,<S4 object of class "ExpressionSet">, : arguments imply differing number of rows: 36, 77, 41.
> I checked the function instructions:
> combine(eSet,eSet): Combine two eSet objects. To be combined, eSets must have identical numbers of featureNames, distinct sampleNames, and identical annotation
> and the three datasets meet these conditions.
> So, can someone help me? Is there a problem in the datasets I downloaded?
> 2) How can I access the code of the function "model.matrix" (package : stats)? Typing the name of the function as I usuallly do seems not to work in this case. Is it a "protected" function?
> Thank you for your help!
> Manuela Di Russo, Ph.D. Student
> Department of Experimental Pathology, MBIE
> University of Pisa
> Pisa, Italy
> e-mail: manuela.dirusso at for.unipi.it
> tel: +39050993538
> [[alternative HTML version deleted]]
> Bioconductor mailing list
> Bioconductor at r-project.org
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
James W. MacDonald, M.S.
University of Michigan
Department of Human Genetics
1241 E. Catherine St.
Ann Arbor MI 48109-5618
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues
More information about the Bioconductor