[BioC] chipAlongChrom problem in Ringo

Jianping Jin jjin at unc.edu
Thu Jan 22 19:22:16 CET 2009


X was created as the following:
MAs <- lapply(RGs, function(thisRG) 
preprocess(thisRG[thisRG$genes$Status=="Probe",],
method="nimblegen", returnMAList=TRUE))
MA <- do.call("rbind", MAs)
X <- asExprSet(MA)
sampleNames(X) <- paste(X$Cy5, X$Feed,sep=".")

> str(probeAnno["5.index"])
 chr [1:21152] "5313_0084_0040" "5313_0760_0290" "5313_0060_0330" ...
> str(probeAnno["5.start"])
 int [1:21152] 3000580 3000665 3000780 3000860 3000975 3001082 3001162 
3001282 3001362 3001462 ...
> str(probeAnno["5.end"])
 int [1:21152] 3000638 3000718 3000837 3000909 3001035 3001138 3001215 
3001339 3001422 3001523 ...
> str(probeAnno["1.index"])
 chr [1:20363] "5313_0105_0555" "5313_0211_0843" "5313_0492_0238" ...
> str(probeAnno["1.start"])
 int [1:20363] 3521489 3521564 3521689 3521764 3521874 3521969 3522064 
3522164 3660593 3660683 ...
> str(probeAnno["1.end"])
 int [1:20363] 3521542 3521613 3521738 3521813 3521923 3522018 3522113 
3522213 3660642 3660732 ...
> str(featureNames(X))
 chr [1:1495582] "MM5000P00314500" "MM5000P00286227" ...
> str(featureNames(X))
 chr [1:1121049] "CHR10FS100018407" "CHR10FS100018507" ...
> featureNames(X)[1:30]
 [1] "CHR10FS100018407" "CHR10FS100018507" "CHR10FS100018612" 
"CHR10FS100018717"
 [5] "CHR10FS100018817" "CHR10FS100018927" "CHR10FS100019007" 
"CHR10FS100019103"
 [9] "CHR10FS100019228" "CHR10FS100019328" "CHR10FS100019428" 
"CHR10FS100019503"
[13] "CHR10FS100019603" "CHR10FS100019703" "CHR10FS100019824" 
"CHR10FS100019927"
[17] "CHR10FS100020027" "CHR10FS100020117" "CHR10FS010119532" 
"CHR10FS010119652"
[21] "CHR10FS010119737" "CHR10FS010119840" "CHR10FS010119947" 
"CHR10FS010120052"
[25] "CHR10FS010120142" "CHR10FS010120232" "CHR10FS010120343" 
"CHR10FS010120453"
[29] "CHR10FS010120552" "CHR10FS010120652"
> str(X)
Formal class 'ExpressionSet' [package "Biobase"] with 6 slots
  ..@ assayData        :<environment: 0x1723efb8>
  ..@ phenoData        :Formal class 'AnnotatedDataFrame' [package 
"Biobase"] with 4 slots
  .. .. ..@ varMetadata      :'data.frame':     8 obs. of  2 variables:
  .. .. .. ..$ varLabel        : Factor w/ 8 levels 
"Cy3","Cy5","DESIGN_NAME",..: 8 5 6 3 7 1 2 4
  .. .. .. ..$ labelDescription: chr [1:8] NA NA NA NA ...
  .. .. ..@ data             :'data.frame':     2 obs. of  8 variables:
  .. .. .. ..$ SlideNumber   : int [1:2] 19657602 20613302
  .. .. .. ..$ FileNameCy3   : chr [1:2] "19657602_532.pair" 
"20613302_532.pair"
  .. .. .. ..$ FileNameCy5   : chr [1:2] "19657602_635.pair" 
"20613302_635.pair"
  .. .. .. ..$ DESIGN_NAME   : chr [1:2] "2007-02-27_MM8_CpG_Island_Pro" 
"2007-02-27_MM8_CpG_Island_Pro"
  .. .. .. ..$ SAMPLE_SPECIES: chr [1:2] "B6" "B6"
  .. .. .. ..$ Cy3           : chr [1:2] "SPF1-input" "GF2-input"
  .. .. .. ..$ Cy5           : chr [1:2] "SPF1-ChIP" "GF1-ChIP"
  .. .. .. ..$ Feed          : chr [1:2] "normFed" "germFree"
  .. .. ..@ dimLabels        : chr [1:2] "sampleNames" "sampleColumns"
  .. .. ..@ .__classVersion__:Formal class 'Versions' [package "Biobase"] 
with 1 slots
  .. .. .. .. ..@ .Data:List of 1
  .. .. .. .. .. ..$ : int [1:3] 1 1 0
  ..@ featureData      :Formal class 'AnnotatedDataFrame' [package 
"Biobase"] with 4 slots
  .. .. ..@ varMetadata      :'data.frame':     7 obs. of  1 variable:
  .. .. .. ..$ labelDescription: chr [1:7] NA NA NA NA ...
  .. .. ..@ data             :'data.frame':     1121049 obs. of  7 
variables:
  .. .. .. ..$ GENE_EXPR_OPTION: Factor w/ 2 levels "BLOCK1","RANDOM": 1 1 
1 1 1 1 1 1 1 1 ...
  .. .. .. ..$ PROBE_ID        :
 Factor w/ 389307 levels "CHR01FS003521489",..: 225730 225731 225732 225733 
225734 225735 225736 225737 225738 225739 ...
  .. .. .. ..$ POSITION        : int [1:1121049] 100018407 100018507 
100018612 100018717 100018817 100018927 100019007 100019103 100019228 
100019328 ...
  .. .. .. ..$ X               : int [1:1121049] 694 181 365 233 96 314 735 
330 640 379 ...
  .. .. .. ..$ Y               : int [1:1121049] 524 959 429 425 22 286 487 
138 636 11 ...
  .. .. .. ..$ Status          : chr [1:1121049] "Probe" "Probe" "Probe" 
"Probe" ...
  .. .. .. ..$ ID              : chr [1:1121049] "CHR10FS100018407" 
"CHR10FS100018507" "CHR10FS100018612" "CHR10FS100018717" ...
  .. .. ..@ dimLabels        : chr [1:2] "rowNames" "columnNames"
  .. .. ..@ .__classVersion__:Formal class 'Versions' [package "Biobase"] 
with 1 slots
  .. .. .. .. ..@ .Data:List of 1
  .. .. .. .. .. ..$ : int [1:3] 1 1 0
  ..@ experimentData   :Formal class 'MIAME' [package "Biobase"] with 13 
slots
  .. .. ..@ name             : chr ""
  .. .. ..@ lab              : chr ""
  .. .. ..@ contact          : chr ""
  .. .. ..@ title            : chr ""
  .. .. ..@ abstract         : chr ""
  .. .. ..@ url              : chr ""
  .. .. ..@ pubMedIds        : chr ""
  .. .. ..@ samples          : list()
  .. .. ..@ hybridizations   : list()
  .. .. ..@ normControls     : list()
  .. .. ..@ preprocessing    : list()
  .. .. ..@ other            : list()
  .. .. ..@ .__classVersion__:Formal class 'Versions' [package "Biobase"] 
with 1 slots
  .. .. .. .. ..@ .Data:List of 1
  .. .. .. .. .. ..$ : int [1:3] 1 0 0
  ..@ annotation       : chr(0)
  ..@ .__classVersion__:Formal class 'Versions' [package "Biobase"] with 1 
slots
  .. .. ..@ .Data:List of 4
  .. .. .. ..$ : int [1:3] 2 8 1
  .. .. .. ..$ : int [1:3] 2 2 1
  .. .. .. ..$ : int [1:3] 1 1 0
  .. .. .. ..$ : int [1:3] 1 0 0
>
> dim(X)
Features  Samples
 1121049        2

Jianping

--On Thursday, January 22, 2009 6:11 PM +0000 Joern Toedling 
<toedling at ebi.ac.uk> wrote:

>
>
> Jianping Jin wrote:
>> table(probeAnno["5.index"] %in% featureNames(X))
>>
>> FALSE
>> 21152
>
> well, none of the probes seems to be included in the names of the probes
> on your arrays.
> How did you create the object "X"? Can you show me the output of
> str(probeAnno["5.index"])
> str(probeAnno["5.start"])
> str(probeAnno["5.end"])
> ## and for comparison:
> str(probeAnno["1.index"])
> str(probeAnno["1.start"])
>
> # And what is
> str(featureNames(X))
> featureNames(X)[1:30]
> str(X)
> dim(X)
>
>
>>
>> What are feature names in probeAnno? Actually I used a new output file
>> from the script you edited to generate probeAnno with no complain:
>>
>> probeAnno <-
>> posToProbeAnno("C:/from_DriveD/Chip-chip/Bultman/allChromExonerateOut_ne
>> w.txt")
>>
>>    Creating probeAnno mapping for chromosome 1 10 11 12 13 14 15 16 17
>> 18 19 2 3 4 5 6 7 8 9 X Y Done.
>> allChrs <- chromosomeNames(probeAnno)
>> genome(probeAnno) <- "M. musculus (mm9)"
>> arrayName(probeAnno) <- "2007-02-27_MM8_CpG_Island_Pro_Tiling"
>
> This looks alright to me. I think the function is working, but somehow
> the probeAnno does not seem to match your data ExpressionSet.
>
> Maybe can you also provide me with the first few lines of this file,
> just to be sure?
> C:/from_DriveD/Chip-chip/Bultman/allChromExonerateOut_new.txt
>
> Joern



##################################
Jianping Jin Ph.D.
Bioinformatics scientist
Center for Bioinformatics
Room 3133 Bioinformatics building
CB# 7104, Campus
Phone: (919)843-6105
FAX:   (919)843-3103
E-Mail: jjin at unc.edu



More information about the Bioconductor mailing list