[R] help with regexp

Jannis bt_jannis at yahoo.de
Wed Oct 5 13:56:52 CEST 2011


Dear list memebers, 


I am stuck with using regular expressions.


Imagine I have a vector of character strings like:

test <- c('filename_1_def.pdf', 'filename_2_abc.pdf')

How could I use regexpressions to extract only the 'def'/'abc' parts of these strings?


Some try from my side yielded no results:

testresults <- grep('(?<=filename_[[:digit:]]_).{1,3}(?=.pdf)', perl = TRUE, value = TRUE)

Somehow I seem to miss some important concept here. Until now I always used nested sub expressions like:

testresults <- sub('.pdf$', '', sub('^filename_[[:digit:]]_', '' , test))


but this tends to become cumbersome and I was wondering whether there is a more elegant way to do this?



Thanks for any help

Jannis





More information about the R-help mailing list