[R] string edit distance

Tobias Verbeke tobias.verbeke at gmail.com
Sat Apr 7 22:14:15 CEST 2007


Thomas Hills wrote:
> I have a column of words, for example
>
> "DOG"
> "DOOG"
> "GOD"
> "GOOD"
> "DOOR"
> ...
>
> and I am interested in creating a matrix that contains the string  
> edit distances between each pair of words.  I am this close  -> '  '   
> <-   to writing the algorithm myself (which will allow for different  
> variations on the string edit rules, indels, plus or minus  
> transpositions, and possibly some variations on that), but I figured  
> I'd see if anyone on the list has any experience with this and might  
> already have some shoulders for me to stand on.
>   
See
    
http://wiki.r-project.org/rwiki/doku.php?id=tips:data-strings:levenshtein
for some R code which might be useful.

HTH,
Tobias

-- 

Tobias Verbeke - Consultant
Business & Decision Benelux
Rue de la révolution 8
1000 Brussels - BELGIUM

+32 499 36 33 15
tobias.verbeke at businessdecision.com



More information about the R-help mailing list