[BioC] Missing Autocorrelation in Ringo

Torsten Waldminghaus Torsten.Waldminghaus at rr-research.no
Wed Jul 1 14:26:11 CEST 2009


Hi Joern,

I did do some updating. This is how it looks like now:

> sessionInfo()
R version 2.9.1 (2009-06-26) 
i386-pc-mingw32 

locale:
LC_COLLATE=Norwegian (Bokmål)_Norway.1252;LC_CTYPE=Norwegian (Bokmål)_Norway.1252;LC_MONETARY=Norwegian (Bokmål)_Norway.1252;LC_NUMERIC=C;LC_TIME=Norwegian (Bokmål)_Norway.1252

attached base packages:
[1] grid      stats     graphics  grDevices datasets  utils     methods   base     

other attached packages:
[1] Ringo_1.9.5        Matrix_0.999375-29 lattice_0.17-25    limma_2.18.2       RColorBrewer_1.0-2
[6] Biobase_2.4.1     

loaded via a namespace (and not attached):
[1] annotate_1.22.0     AnnotationDbi_1.6.1 DBI_0.2-4           genefilter_1.24.2   RSQLite_0.7-1      
[6] splines_2.9.1       survival_2.35-4     xtable_1.5-5       


I coul now also find and use the extractProbeAnno function but get this:

> test<-extractProbeAnno(RG)
Creating probeAnno mapping for chromosome Error in split.default(probeindex, as.factor(hits[[chrNameColumn]])) : 
  Group length is 0 but data length > 0
In addition: Warning message:
In extractProbeAnno(RG) :
  Some reporters had no or an unrecognized genome position in RG$genes$SystematicName.


You said that the function works with an RGlist object if it" encodes the probe position in a colum called "SystematicName"". This is the case but it looks like this: 

> RG$genes$SystematicName[100]
[1] "CGH_U00096_943375_943434_a"

Is this what you ment?

I also tried the autocorraltion again as before but did get the same strange result. Could it be that it is a problem that the data in the RG file are not ordered? I had problems with that when I used some other peak detection because the program looked just at the next entry without considering the probe position. My DNA fragments where 1000bp and smaller with a peak at 500bp.

About the judging of quality of found peaks, could you explain what " resort to visualizations" means and how I would do this practically.

Many thanks,
Torsten

 




-----Original Message-----
From: Joern Toedling [mailto:Joern.Toedling at curie.fr] 
Sent: 30. juni 2009 16:40
To: Torsten Waldminghaus; bioconductor at stat.math.ethz.ch
Subject: Re: [BioC] Missing Autocorrelation in Ringo

Hi Torsten,

I suggest that you first update your R and Bioconductor installation as you
are using very out-dated versions of both (R is currently 2.9.0 and for Ringo
please use the current development version (1.9.5), which can be downloaded as
a binary from here:
http://www.bioconductor.org/packages/devel/bioc/html/Ringo.html
). In the current version of Ringo, you will also find the function
extractProbeAnno. And indeed such a low degree of autocorrelation is
unexpected. What was the length distribution of fragments after sonication?
Typically you would expect some auto-correlation at least up to that length.
This factor is also important for setting the sliding-window width for
smoothing the signal prior to peak detection. However, your data objects look
alright, so I am afraid that I do not immediately see the problem.

For judging the quality of the found "peaks", you can resort to visualizations
of a.) the highest-ranking peaks b.) the lowest-ranking peaks c.) positive
control regions where you would expect peaks d.) negative control-regions
where no enrichment is to be expected.
Usually the "null distribution" of smoothed probe levels under non-enrichment
can only roughly be estimated from the data based on certain simplifying
assumptions, so I would not lend too much confidence to p-values and FDRs from
such estimates.

Regards,
Joern


On Tue, 30 Jun 2009 14:04:31 +0200, Torsten Waldminghaus wrote
> Hi,
> 
> thanks for the answer and sorry for not making it more clear. I get 
> a plot but it suggests that there is no autocorrelation. This means 
> the column at point "0" is at 1.0 as usual but at "100", "200",... 
> there are only dots which vary a bit in their size. Now the results 
> you were interested in:
> 
> > str(exAc)
> Class 'autocor.result'  atomic [1:11]  1.00000 -0.00469 -0.00660 
>  0.00538  0.00652 ...  ..- attr(*, "chromosome")= chr "1"
> 
> > ls(annoObject)
> [1] "1.end"    "1.index"  "1.start"  "1.unique"
> 
> > head(annoObject["1.start"])
> [1]  68 154 189 294 365 440
> 
> I tried the function extractProbeAnno but it does actually not seem 
> to be there. R did not find it and I did also not find a help entry 
> for it:
> 
> > help(extractProbeAnno)
> No documentation for 'extractProbeAnno' in specified packages and libraries:
> you could try 'help.search("extractProbeAnno")'
> 
> However, I could use for example posToProbeAnno and find the 
> corresponding help page?!
> 
> Here is the session info:
> 
> > sessionInfo()
> R version 2.7.2 (2008-08-25) 
> i386-pc-mingw32
> 
> locale:
> LC_COLLATE=Norwegian (Bokmål)_Norway.1252;LC_CTYPE=Norwegian (Bokmål)
> _Norway.1252;LC_MONETARY=Norwegian (Bokmål)
> _Norway.1252;LC_NUMERIC=C;LC_TIME=Norwegian (Bokmål)_Norway.1252
> 
> attached base packages:
> [1] splines   tools     stats     graphics  grDevices utils    
>  datasets  methods   base
> 
> other attached packages:
>  [1] Ringo_1.4.0          SparseM_0.78         RColorBrewer_1.0-2  
>  vsn_3.6..0            affy_1.18.2          [6] preprocessCore_1.2.1 
> affyio_1.8.1         geneplotter_1.18.0   annotate_1.18.0      
> xtable_1.5-4        [11] AnnotationDbi_1.2.2  RSQLite_0.7-1        
> DBI_0.2-4            lattice_0.17-13      genefilter_1.20.1   [16] 
> survival_2.34-1      Biobase_2.0.1        limma_2.14.7
> 
> loaded via a namespace (and not attached):
> [1] grid_2.7.2         KernSmooth_2.22-22
> 
> Beyond the problems with getting the autocorrelation I was wondering 
> if there is a good way to judge the quality of Ringo peak detection 
> or the probability of getting wrong hits or loosing some?
> 
> Thanks for any help,
> Torsten



More information about the Bioconductor mailing list