[R] strapply and characters adjacent to the matched pattern

mdvaan mathijsdevaan at gmail.com
Tue Jul 24 23:06:42 CEST 2012


Hi,

In the example below, one of the searched patterns "SE" is matched in the
word "second". I would like to ignore all matches in which the character
following the match is one of [:alpha:]. How do I do this without removing
the "ignore.case = T" argument of the strapply function? Thank you very
much!

# load library
require(gsubfn)
# read in data 
data <- c("Santa Fe Gold Corp|Starpharma Holdings|SE")
# define the object to be searched 
text <- c("the first is Santa Fe Gold Corp", "the second is Starpharma
Holdings") 
# match 
strapply(text, data, ignore.case = T)

The preferred outcome would be:

[[1]]
[1] "Santa Fe Gold Corp"

[[2]]
[1] "Starpharma Holdings"

instead of:

[[1]]
[1] "Santa Fe Gold Corp"

[[2]]
[1] "se"                  "Starpharma Holdings"






--
View this message in context: http://r.789695.n4.nabble.com/strapply-and-characters-adjacent-to-the-matched-pattern-tp4637673.html
Sent from the R help mailing list archive at Nabble.com.



More information about the R-help mailing list