[BioC] DiffBind error loading dba

Rory Stark Rory.Stark at cruk.cam.ac.uk
Wed Feb 12 12:50:04 CET 2014



On 12/02/2014 11:34, "Gordon Brown" <Gordon.Brown at cruk.cam.ac.uk> wrote:

>Hi, Sharona,
>
>Part of the problem is that some of your peaks have width zero (or 1,
>depending on the exact definition of "narrow" format).  For example, line
>12826 of synch_PolII_rep1_narrowPeakn.bed is
>
>---------------
>chr1	84765176	84765176	.	9.263	.	9.263	1.709270e+00	2.493753e-02	0
>---------------
>
>When DiffBind tries to normalise the score based on the width of the peak,
>it comes out as N/0, which R shows as infinity.  Things go downhill from
>there.  This is a bug in DiffBind (if the width is zero, we should just
>make it 1, for example).  But in the meantime, to work around it, just
>remove those peaks (or add 1 to the right coordinate).  (In bed format,
>intervals are represented as half-open, i.e. the right end is one past the
>end of the interval; I'll have to check the exact definition of "narrow"
>format to see if we should be interpreting the coordinates differently.)
>
>Regarding memory usage, the latest version of DiffBind is *much* more
>memory efficient than previous ones, so upgrading should take care of
>that.
>
>Cheers,
>
> - Gord
>
>
>On 2014-02-11 18:10, "Sharona Elgavish" <SharonaE at ekmd.huji.ac.il> wrote:
>
>>Dear all,
>>
>>I'm trying to upload my Chip-Seq data to DiffBind and receive the
>>following error:
>>
>>>  pol = dba(sampleSheet="pol_narrow_bam_allpeaks.csv",
>>>peakFormat="narrow")
>>
>>polsyn1 hella pol synchronized  1 narrow
>>polsyn2 hella pol synchronized  2 narrow
>>polunsyn1 hella pol unsynchronized  1 narrow
>>polunsyn2 hella pol unsynchronized  2 narrow
>>Error in if (res >= minval) { : missing value where TRUE/FALSE needed
>>
>>> sessionInfo()
>>R version 3.0.1 (2013-05-16)
>>Platform: x86_64-pc-linux-gnu (64-bit)
>>
>>locale:
>> [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
>> [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
>> [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
>> [7] LC_PAPER=C                 LC_NAME=C
>> [9] LC_ADDRESS=C               LC_TELEPHONE=C
>>[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>>
>>attached base packages:
>>[1] parallel  stats     graphics  grDevices utils     datasets  methods
>>[8] base
>>
>>other attached packages:
>>[1] DiffBind_1.6.2       Biobase_2.20.1       GenomicRanges_1.12.5
>>[4] IRanges_1.18.4       BiocGenerics_0.6.0   BiocInstaller_1.10.4
>>
>>loaded via a namespace (and not attached):
>> [1] amap_0.8-7         bitops_1.0-6       caTools_1.16       edgeR_3.2.4
>> [5] gdata_2.13.2       gplots_2.12.1      gtools_3.2.1
>>KernSmooth_2.23-10
>> [9] limma_3.16.8       RColorBrewer_1.0-5 stats4_3.0.1       tools_3.0.1
>>
>>[13] zlibbioc_1.6.0
>>
>>
>>
>>I get that error message whether I install DiffBind through the
>>Bioconductor that is, biocLite("DiffBind"), or by installing
>>DiffBind_1.8.3 by
>>install.packages(DiffBind_1.8.3.tar.gz")
>>
>>When I took only the first 4000 peaks in each of the peak files (I have
>>4) I easily manged to upload the dba. However, running dba.count was
>>successful only upon using the bUseSummarizeOverlaps=TRUE,
>>bParallel=FALSE options
>>
>>The peak files are in narrowPeaks format and were generated using the
>>program PeakSeq.
>>
>>The peak files contain 10,000-32,000 peaks
>>
>>Am I out of memory? In case I am, how can I increase the memory during
>>the upload (and not at dba.count)?
>>
>>I attached the sampleSheet and the 4 peaks files.
>>
>>
>>I would much appreciate any help
>>Thanks!
>>
>>Sharona
>>
>



More information about the Bioconductor mailing list