[R] Faster union of polygons?

Martin Morgan mtmorgan at fhcrc.org
Fri Jun 4 04:17:34 CEST 2010


On 06/03/2010 04:54 PM, Remko Duursma wrote:
> Thanks for the tip - this cleans up the code a lot!
> Unfortunately, there is no gain in speed.

Playing a little bit dirty,

punion <-
    function(...)
{
    n <- nargs()
    if (0L == n) new("gpc.poly")
    else if (1L == n && is(..1, "gpc.poly")) ..1
    else {
        polygons <- list(...)
        if (!all(sapply(polygons, is, "gpc.poly")))
            stop("'...' must all be 'gpc.poly'")
        ## avoid method look-up
        to_numeric <- selectMethod(coerce, c("gpc.poly", "numeric"))
        vec <- to_numeric(polygons[[1]])
        for (p in polygons[-1]) {
            clip <- to_numeric(p)
            vec <- .Call("Rgpc_polygon_clip", vec, clip, 3,
                         PACKAGE="gpclib")
        }
        if (identical(vec, 0))
            new("gpc.poly")
        else
            as(vec, "gpc.poly")
    }
}

is about 4x faster on your example

> replicate(5, system.time(Reduce(union, leaves)))
            [,1]  [,2]  [,3]  [,4]  [,5]
user.self  1.272 1.272 1.272 1.268 1.268
sys.self   0.000 0.000 0.000 0.000 0.000
elapsed    1.271 1.272 1.272 1.273 1.281
user.child 0.000 0.000 0.000 0.000 0.000
sys.child  0.000 0.000 0.000 0.000 0.000
> replicate(5, system.time(do.call(punion, leaves)))
            [,1]  [,2]  [,3]  [,4]  [,5]
user.self  0.308 0.312 0.304 0.308 0.312
sys.self   0.004 0.000 0.004 0.004 0.000
elapsed    0.311 0.311 0.309 0.314 0.317
user.child 0.000 0.000 0.000 0.000 0.000
sys.child  0.000 0.000 0.000 0.000 0.000

Rprof suggests that most of the time is now in the C code

> Rprof("/tmp/leaves.Rprof")
> x <- replicate(5, system.time(do.call(punion, leaves)))
> Rprof(NULL)
> summaryRprof("/tmp/leaves.Rprof")
$by.self
                     self.time self.pct total.time total.pct
".Call"                   1.24     69.7       1.24      69.7
"gc"                      0.24     13.5       0.24      13.5
"FUN"                     0.08      4.5       1.78     100.0
[...SNIP...]

Martin

> 
> remko
> 
> 
> 
> On Thu, Jun 3, 2010 at 10:46 PM, nikhil kaza <nikhil.list at gmail.com> wrote:
>> Reduce might work. Not sure about the speed advantages though. It does
>> simplify code.
>>
>>  Unionall <- function(x) Reduce('union', x)
>> leaveout <- Unionall(leaves)
>>
>>
>> On Tue, Jun 1, 2010 at 9:53 PM, Remko Duursma <remkoduursma at gmail.com>
>> wrote:
>>>
>>> Dear R-helpers,
>>>
>>> thanks for yesterday's speeding-up tip. Here is my next query:
>>>
>>> I have lots of polygons (not necessarily convex ones, and they never
>>> have holes) given by x,y coordinates.
>>>
>>> I want to get the polygon that is the union of these polygons. This is
>>> my current method, but I am hoping there is a faster method (up to
>>> thousands of polygons, each with ca. 40 xy points).
>>>
>>> Example:
>>>
>>> library(gpclib)
>>>
>>> # A polygon
>>> leaf <- structure(c(0, 1, 12.9, 16.5, 18.8, 17, 16.8, 15.5, 12.1, 8.2,
>>> 6.3, 5, 2, 0, -1.5, -4.3, -6.6, -10.3, -14.8, -19.4, -22.2, -23.5,
>>> -22.2, -17.6, -7.8, 0, 0, -2.4, 2.8, 8.9, 19.9, 33.9, 34.8, 40.4,
>>> 49.7, 69.2, 77.4, 83.4, 91.4, 99, 92.8, 87.3, 81.2, 71.1, 57.6,
>>> 45.4, 39.2, 26, 15.6, 5.3, 0.6, 0), .Dim = c(26L, 2L), .Dimnames = list(
>>>    NULL, c("X", "Y")))
>>>
>>> # Lots of polygons:
>>> releaf <-
>>> function(leaf)cbind(leaf[,1]+rnorm(1,0,50),leaf[,2]+rnorm(1,0,50))
>>> leaves <- replicate(500, releaf(leaf), simplify=FALSE)
>>>
>>> # Make into gpc.poly class:
>>> leaves <- lapply(leaves, as, "gpc.poly")
>>>
>>> # Make union .....
>>> system.time({
>>> leavesoutline <- union(leaves[[1]], leaves[[2]])
>>> for(i in 3:length(leaves))leavesoutline <- union(leavesoutline,
>>> leaves[[i]])
>>> })
>>> # about 1sec here.
>>>
>>> # Check it:
>>> plot(leavesoutline)
>>>
>>>
>>>
>>> thanks!
>>>
>>> Remko
>>>
>>>
>>> -------------------------------------------------
>>> Remko Duursma
>>> Research Lecturer
>>>
>>> Centre for Plants and the Environment
>>> University of Western Sydney
>>> Hawkesbury Campus
>>> Richmond NSW 2753
>>>
>>> Dept of Biological Science
>>> Macquarie University
>>> North Ryde NSW 2109
>>> Australia
>>>
>>> Mobile: +61 (0)422 096908
>>> www.remkoduursma.com
>>>
>>> ______________________________________________
>>> R-help at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>>
>>
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.


-- 
Martin Morgan
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109

Location: Arnold Building M1 B861
Phone: (206) 667-2793



More information about the R-help mailing list