[R] Searching for Enumerated Items using str_count() from the stringr package

Tóth Dénes toth.denes at kogentum.hu
Fri Sep 29 00:02:45 CEST 2017



On 09/28/2017 10:25 PM, Dan Abner wrote:
> Hi all,
> 
> I have a large number of text strings to search for enumerated items.
> However, I am receiving this error message even though I thought that I
> properly escaped the special character closed parenthesis:
> 
> 
>> Count<-str_count(text3,keywords)
> Error in stri_count_regex(string, pattern, opts_regex = opts(pattern)) :
>    Syntax error in regexp pattern. (U_REGEX_RULE_SYNTAX)
> 
> 
> ===
> 
> Here is example code:
> 
> 
> text1<-"This is a list:
> 1) Number 1
> 2) Etc
> 3) Etc"
> 
> text2<-"This is NOT a list:
> Blah, blah, blah
> Blah, blah, blah"
> 
> text3<-c(text1,text2)
> text3
> 
> {keywords<-c(paste(0:9,"\\)"),paste(0:9,"\\)",sep=""),
> paste(0:9,"."),paste(0:9,".",sep=""),"-","*")}
> 

You should carefully read the docs, see ?regexp.
You really do not want to pass a multi-element vector as 'keywords' in 
this case, but instead:

stri_count_regex(text3, "[0-9]+\\) ")

or:

stri_count_regex(text3, "[[:digit:]]+\\) ")

BTW, I do not understand why to use the stringr package if it is just a 
wrapper around the stringi package.

Regards,
Denes




> keywords
> 
> Count<-str_count(text3,keywords)
> 
> ===
> 
> I am looking for Count<-c(3,0)
> 
> Any suggestions?
> 
> Thanks!
> 
> Dan
> 
> 	[[alternative HTML version deleted]]
> 
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 

-- 
Dr. Tóth Dénes ügyvezető
Kogentum Kft.
Tel.: 06-30-2583723
Web: www.kogentum.hu



More information about the R-help mailing list