[BioC] Updated affy package? ReadAffy/ AffyBatch has changed and breaks my pipeline!

James W. MacDonald jmacdon at med.umich.edu
Mon Feb 6 16:35:50 CET 2012


Hi Benjamin,

You need to supply us with a small reproducible example of the problem 
you are experiencing, so we can either track down the error, or explain 
how you can fix things on your end.

Best,

Jim



On 2/6/2012 7:43 AM, Benjamin Høyer wrote:
> Hi
>
> I have made a pipeline for automating analysis of custom affymetrix
> chips for a colleague.  After an R-update, the pipeline no longer
> runs.  Working backwards from where the error occurs, I have found
> that the structure produced by ReadAffy() is no longer the same.  I am
> using exactly the same data set.  (The error itself a failed assign()
> due to a locked binding, but I don't think that's related.)
>
> Below are the lists of packages installed before and after the update,
> with version numbers.  The 'old' R version is 2.13.2, the 'new' one is
> 2.14.1.
>
> OldPackVers<- structure(c("1.30.0", "1.20.0", "1.28.5", "1.14.1", "1.30.0",
> "0.0.1", "0.0.1", "2.13.2", "2.12.2", "2.20.4", "1.0-4.1", "1.3-2",
> "1.12", "7.3-3", "1.14.1", "0.2-8", "2.13.2", "1.6.2", "2.13.2",
> "0.2-5", "1.30.0", "1.6", "1.2.8", "0.8-48", "2.24.1", "2.8.2",
> "1.2.5", "2.10.1", "2.13.2", "2.13.2", "2.13.2", "2.6.2", "1.10.6",
> "2.23-6", "0.19-33", "3.8.3", "1.1.7", "1.30.0", "1.30.0", "7.3-14",
> "1.0-3", "2.13.2", "2.10.0", "1.7-11", "3.1-103", "7.3-1", "1.14.0",
> "3.1-51", "0.11.1", "7.3-3", "2.13.2", "2.13.2", "2.13.2", "2.36-10",
> "2.13.2", "1.24.0", "1.30.0", "2.13.2", "2.13.2", "1.30.0"), .Names = c("affy",
> "affyio", "affyPLM", "AnnotationDbi", "asprgdtua520520fcdf",
> "asprgdtua520520fprobe", "bac01a520746fprobe", "base", "Biobase",
> "Biostrings", "bitops", "boot", "caTools", "class", "cluster",
> "codetools", "compiler", "corpcor", "datasets", "DBI", "DynDoc",
> "e1071", "fdrtool", "foreign", "gcrma", "gdata", "GeneNet", "gplots",
> "graphics", "grDevices", "grid", "gtools", "IRanges", "KernSmooth",
> "lattice", "limma", "longitudinal", "makecdfenv", "marray", "MASS",
> "Matrix", "methods", "Mfuzz", "mgcv", "nlme", "nnet", "preprocessCore",
> "rpart", "RSQLite", "spatial", "splines", "stats", "stats4",
> "survival", "tcltk", "timecourse", "tkWidgets", "tools", "utils",
> "widgetTools"))
>
> NewPackVers<- structure(c("1.32.1", "1.22.0", "1.30.0", "1.16.11", "1.32.0",
> "0.0.1", "2.14.1", "2.14.0", "1.2.1", "2.22.0", "1.0-4.1", "1.3-4",
> "1.12", "7.3-3", "1.14.1", "0.2-8", "2.14.1", "1.6.2", "2.14.1",
> "0.2-5", "1.32.0", "1.6", "1.2.8", "0.8-48", "2.26.0", "2.8.2",
> "1.2.5", "2.10.1", "2.14.1", "2.14.1", "2.14.1", "2.6.2", "1.12.5",
> "2.23-7", "0.20-0", "3.10.2", "1.1.7", "1.32.0", "1.32.0", "7.3-16",
> "1.0-3", "2.14.1", "2.12.0", "1.7-13", "3.1-103", "7.3-1", "2.14.1",
> "1.16.0", "3.1-51", "0.11.1", "7.3-3", "2.14.1", "2.14.1", "2.14.1",
> "2.36-10", "2.14.1", "1.26.0", "1.32.0", "2.14.1", "2.14.1",
> "1.32.0", "1.0.0"), .Names = c("affy", "affyio", "affyPLM", "AnnotationDbi",
> "asprgdtua520520fcdf", "asprgdtua520520fprobe", "base", "Biobase",
> "BiocInstaller", "Biostrings", "bitops", "boot", "caTools", "class",
> "cluster", "codetools", "compiler", "corpcor", "datasets", "DBI",
> "DynDoc", "e1071", "fdrtool", "foreign", "gcrma", "gdata", "GeneNet",
> "gplots", "graphics", "grDevices", "grid", "gtools", "IRanges",
> "KernSmooth", "lattice", "limma", "longitudinal", "makecdfenv",
> "marray", "MASS", "Matrix", "methods", "Mfuzz", "mgcv", "nlme",
> "nnet", "parallel", "preprocessCore", "rpart", "RSQLite", "spatial",
> "splines", "stats", "stats4", "survival", "tcltk", "timecourse",
> "tkWidgets", "tools", "utils", "widgetTools", "zlibbioc"))
>
> The differences in structure are thus:
>
> datOld<- ReadAffy(celfile.path=subdircel)
> datOld
>> AffyBatch object
>> size of arrays=716x716 features (24 kb)
>> cdf=AsprgDTUa520520F (11186 affyids)
>> number of samples=27
>> number of genes=11186
>> annotation=asprgdtua520520f
>> notes=
> datNew<- ReadAffy(celfile.path=subdircel)
> datNew
>> AffyBatch object
>> size of arrays=716x716 features (15 kb)
>> cdf=AsprgDTUa520520F (11186 affyids)
>> number of samples=27
>> number of genes=11186
>> annotation=asprgdtua520520f
>> notes=
>
> The particular slot which causes trouble is phenoData;
> datOld at phenoData
>> An object of class "AnnotatedDataFrame"
>>   sampleNames: 1a_N6h_ t1.CEL, 1b_N6h_t2.CEL, ..., 9c_N192h_t3.CEL  (27 total)
>>   varLabels and varMetadata description:
>>     sample: arbitrary numbering
> datNew at phenoData
>> An object of class "AnnotatedDataFrame"
>>   sampleNames: 1a_N6h_ t1.CEL 1b_N6h_t2.CEL ... 9c_N192h_t3.CEL (27 total)
>>   varLabels: sample
>>   varMetadata: labelDescription
>
> Is this a likely culprit?  Another option could be how 'get' works.
> Basically my function 'get's the probe package name (the script
> follows Gillespie's doi:10.1186/1756-0500-3-81).  In the first case, I
> get
>
> get(probepackagename)
>> Object of class probetable data.frame with 121507 rows and 6 columns.
> And in the second case, I get:
>> Object of class probetable data.frame with 468384 rows and 6 columns.
>> First 3 rows are:
>>                     sequence x y    Probe.Set.Name Probe.Interrogation.Position
>> 1<seq1>  5 1<name1>                            13
>> 2<seq2>  6 1<name2>                            13
>> 3<seq3>  7 1<name3>                            13
>>    Target.Strandedness
>> 1           Antisense
>> 2           Antisense
>> 3           Antisense
> Basically, I would like to know what has changed in the newer versions
> that I have!  Could also be that the 'probepackagename' (which is
> generated earlier in the script) has changed.  This would be the
> result of an update in makeProbePackage() in AnnotationDbi.
>
> I'll keep looking for ways to get round it, but thanks in advance!
>
> Regards,
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Douglas Lab
University of Michigan
Department of Human Genetics
5912 Buhl
1241 E. Catherine St.
Ann Arbor MI 48109-5618
734-615-7826

**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues 



More information about the Bioconductor mailing list