[BioC] [devteam-bioc] GenomicRanges::findOverlap unexpected behaviour with 0 width ranges

Valerie Obenchain vobencha at fhcrc.org
Mon Sep 23 19:09:10 CEST 2013


Hello,

Thanks for reporting the bug. The behavior is fixed in IRanges 1.19.38 
and 1.18.4.

A zero-width range in 'query' no longer registers as a hit, regardless 
of 'maxgap' and 'minoverlap' values.

 > findOverlaps(rangeZero, rangeContains, maxgap=1) # match
Hits of length 0
queryLength: 1
subjectLength: 1

 > findOverlaps(rangeZero, rangeStartMatch, maxgap=1) # match
Hits of length 0
queryLength: 1
subjectLength: 1


Valerie

On 09/19/2013 10:54 PM, Maintainer wrote:
>
> The hits returned when using findOverlap seem strange when a zero width range is involved. By default, findOverlap does not return any hits with a zero width range, even if other range contains the zero width range. Presumably this is due the requirement of a positive, non-zero minoverlap. If a maxgap is set, a hit is returned only if the zero width range is the query.
>
> According to the documentation, "all matches must additionally satisfy the minoverlap constraint". This constraint is violated when a zero-width range is passed as the query parameter with a non-zero maxgap. I presume this is a bug, is it not?
>
> rangeZero <- IRanges(5, 4)
> rangeContains <- IRanges(3, 6)
> rangeStartMatch <- IRanges(5, 5)
> findOverlaps(rangeZero, rangeContains) # no match
> findOverlaps(rangeZero, rangeStartMatch) # no match
> findOverlaps(rangeContains, rangeZero) # no match
> findOverlaps(rangeStartMatch, rangeZero) # no match
> findOverlaps(rangeZero, rangeContains, maxgap=1) # match
> findOverlaps(rangeZero, rangeStartMatch, maxgap=1) # match
> findOverlaps(rangeContains, rangeZero, maxgap=1) # no match
> findOverlaps(rangeStartMatch, rangeZero, maxgap=1) # no match
>
>
>   -- output of sessionInfo():
>
>
>
> R version 3.0.1 (2013-05-16)
> Platform: x86_64-w64-mingw32/x64 (64-bit)
>
> locale:
> [1] LC_COLLATE=English_Australia.1252  LC_CTYPE=English_Australia.1252    LC_MONETARY=English_Australia.1252 LC_NUMERIC=C
> [5] LC_TIME=English_Australia.1252
>
> attached base packages:
> [1] parallel  stats     graphics  grDevices utils     datasets  methods   base
>
> other attached packages:
> [1] GenomicFeatures_1.12.3 AnnotationDbi_1.22.6   Biobase_2.20.1         GenomicRanges_1.12.5   IRanges_1.18.3         BiocGenerics_0.6.0
> [7] BiocInstaller_1.10.3
>
> loaded via a namespace (and not attached):
>   [1] biomaRt_2.16.0     Biostrings_2.28.0  bitops_1.0-6       BSgenome_1.28.0    DBI_0.2-7          RCurl_1.95-4.1     Rsamtools_1.12.4   RSQLite_0.11.4
>   [9] rtracklayer_1.20.4 stats4_3.0.1       tools_3.0.1        XML_3.98-1.1       zlibbioc_1.6.0
>
> --
> Sent via the guest posting facility at bioconductor.org.
>
> ________________________________________________________________________
> devteam-bioc mailing list
> To unsubscribe from this mailing list send a blank email to
> devteam-bioc-leave at lists.fhcrc.org
> You can also unsubscribe or change your personal options at
> https://lists.fhcrc.org/mailman/listinfo/devteam-bioc
>



More information about the Bioconductor mailing list