fastLink: Fast Probabilistic Record Linkage with Missing Data

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ”Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records”, available at <>.

Version: 0.2.0
Depends: R (≥ 2.14.0)
Imports: Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, stringi, Rcpp (≥ 0.12.7), FactoClass, adagio, dplyr, plotrix, grDevices, graphics
LinkingTo: RcppArmadillo, Rcpp, RcppEigen
Published: 2017-09-01
Author: Ted Enamorado [aut, cre], Ben Fifield [aut], Kosuke Imai [aut]
Maintainer: Ted Enamorado <fastlinkr at>
License: GPL (≥ 3)
NeedsCompilation: yes
CRAN checks: fastLink results


Reference manual: fastLink.pdf
Package source: fastLink_0.2.0.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X El Capitan binaries: r-release: fastLink_0.2.0.tgz
OS X Mavericks binaries: r-oldrel: fastLink_0.2.0.tgz
Old sources: fastLink archive


Please use the canonical form to link to this page.