[R] Logic with regexps

Gabor Grothendieck ggrothendieck at gmail.com
Sat Jun 12 16:10:13 CEST 2010


On Sat, Jun 12, 2010 at 5:38 AM, Ted Harding
<Ted.Harding at manchester.ac.uk> wrote:
> Greetings,
> The following question has come up in an off-list discussion.
> Is it possible to construct a regular expression 'rex' out of
> two given regular expressions 'rex1' and 'rex2', such that a
> character string X matches 'rex' if and only if X matches 'rex1'
> AND X does not match 'rex2'?
>
> The desired end result can be achieved by logically combining
> the results of a grep using 'rex1' with the results of a grep
> on 'rex2', illustrated by the following example:
>
> ## Given character vector X (below), and two regular exdpressions
> ## rex1="abc", rex2="ijk", to return the elements of X which match
> ## rex1 AND do not match rex1:
> X <- c(
>  "abcdefg",       # Yes
>  "abchijk",       # No
>  "mnopqrs",       # No
>  "ijkpqrs",       # No
>  "abcpqrs" )      # Yes
> rex1 <- "abc"
> rex2 <- "ijk"
> ix1<- grep(rex1,X)
> ix2<- grep(rex2,X)
> X[ix1[!(ix1 %in% ix2)]]
> ## [1] "abcdefg" "abcpqrs"
>
> Question: is there a way to construct 'rex' from 'rex1' and 'rex2'
> such that
>
>  X[grep(rex,X)]
>
> would given the same result?

Try this:

   rex <- "^(?!(.*ijk)).*abc"
   grep(rex, X, perl = TRUE)

Also note that X[grep(rex, X, perl = TRUE)] can be written:

   grep(rex, X, perl = TRUE, value = TRUE)

See ?regex for more info.  Further regular expression links can be
found in the External Links box on the gsubfn home page at
http://gsubfn.googlecode.com



More information about the R-help mailing list