stringdist: Approximate String Matching and String Distance Functions

Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (damerau-levenshtein, hamming, levenshtein, optimal sting alignment), qgrams (q-gram, cosine, jaccard distance) or heuristic metrics (jaro, jaro-winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences.

Version: 0.9.3
Depends: R (≥ 2.15.3)
Imports: parallel
Suggests: testthat
Published: 2015-08-21
Author: Mark van der Loo [aut, cre], Jan van der Laan [ctb], R Core Team [ctb], Nick Logan [ctb]
Maintainer: Mark van der Loo <mark.vanderloo at>
License: GPL-3
NeedsCompilation: yes
Citation: stringdist citation info
Materials: NEWS
In views: OfficialStatistics
CRAN checks: stringdist results


Reference manual: stringdist.pdf
Package source: stringdist_0.9.3.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X Snow Leopard binaries: r-release: stringdist_0.9.3.tgz, r-oldrel: stringdist_0.9.0.tgz
OS X Mavericks binaries: r-release: stringdist_0.9.3.tgz
Old sources: stringdist archive

Reverse dependencies:

Reverse depends: brewdata, vwr
Reverse imports: lintr, PGRdup, qdap, tcR
Reverse suggests: rlist, sjmisc, sprint, statar