refinr: Cluster and Merge Similar Values Within a Character Vector

These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.

Version: 0.3.3
Depends: R (≥ 3.0.2)
Imports: Rcpp, stringdist (≥ 0.9.5.1), stringi
LinkingTo: Rcpp, stringdist (≥ 0.9.5.1)
Suggests: testthat, knitr, rmarkdown, dplyr
Published: 2023-11-12
DOI: 10.32614/CRAN.package.refinr
Author: Chris Muir [aut, cre]
Maintainer: Chris Muir <chrismuirRVA at gmail.com>
BugReports: https://github.com/ChrisMuir/refinr/issues
License: GPL-3
URL: https://github.com/ChrisMuir/refinr
NeedsCompilation: yes
Materials: README NEWS
CRAN checks: refinr results

Documentation:

Reference manual: refinr.pdf
Vignettes: Intro to refinr

Downloads:

Package source: refinr_0.3.3.tar.gz
Windows binaries: r-devel: refinr_0.3.3.zip, r-release: refinr_0.3.3.zip, r-oldrel: refinr_0.3.3.zip
macOS binaries: r-release (arm64): refinr_0.3.3.tgz, r-oldrel (arm64): refinr_0.3.3.tgz, r-release (x86_64): refinr_0.3.3.tgz, r-oldrel (x86_64): refinr_0.3.3.tgz
Old sources: refinr archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=refinr to link to this page.