[R] Omitting repeated occurrence in a string

Eik Vettorazzi E.Vettorazzi at uke.de
Wed Feb 6 20:48:12 CET 2013


Hi Christopher,
what is the rule to omit "ah" which is also repeated in Text?
The following might be a start:

Text <- "ahsgdvasgAbcabcsdahj"
#finds first repetion of substrings of length 2 or more, here "ah"
gsub("(?i)([a-z]{2,})(.*)\\1","\\1\\2",Text,perl=T)
#finds all repetions of substrings of length 3 or more, here "Abc"
gsub("(?i)([a-z]{3,})(.*)\\1","\\1\\2",Text,perl=T)
#finds only subsequent repetions of substrings of length 2 or more
gsub("(?i)([a-z]{2,})\\1","\\1",Text,perl=T)

hth.

Am 06.02.2013 17:46, schrieb Christofer Bogaso:
> Hello again,
> 
> I was looking for some way on How to delete repeated appearance in a
> String. Let say I have following string:
> 
> Text <- "ahsgdvasgAbcabcsdahj"
> 
> Here you see "Abc" appears twice. But I want to keep only 1
> occurrence. Therefore I need that:
> 
> Text_result <- "ahsgdvasgAbcsdahj" (i.e. the first one).
> 
> Can somebody help me if it is possible using some R function?
> 
> Thanks and regards,
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 


-- 
Eik Vettorazzi

Department of Medical Biometry and Epidemiology
University Medical Center Hamburg-Eppendorf

Martinistr. 52
20246 Hamburg

T ++49/40/7410-58243
F ++49/40/7410-57790



More information about the R-help mailing list