fastLink: Fast Probabilistic Record Linkage with Missing Data

Implements a Fellegi-Sunter probabilistic record linkage model that allows for missing data and the inclusion of auxiliary information. This includes functionalities to conduct a merge of two datasets under the Fellegi-Sunter model using the Expectation-Maximization algorithm. In addition, tools for preparing, adjusting, and summarizing data merges are included. The package implements methods described in Enamorado, Fifield, and Imai (2017) ”Using a Probabilistic Model to Assist Merging of Large-scale Administrative Records”, available at <>.

Version: 0.1.1
Depends: R (≥ 3.1.0)
Imports: Matrix, parallel, foreach, doParallel, gtools, data.table, stringdist, stringr, Rcpp (≥ 0.12.9), FactoClass, adagio, dplyr
LinkingTo: RcppArmadillo, Rcpp, RcppEigen
Published: 2017-07-11
Author: Ted Enamorado [aut, cre], Ben Fifield [aut], Kosuke Imai [aut]
Maintainer: Ted Enamorado <fastlinkr at>
License: GPL (≥ 3)
NeedsCompilation: yes
CRAN checks: fastLink results


Reference manual: fastLink.pdf
Package source: fastLink_0.1.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X El Capitan binaries: r-release: fastLink_0.1.1.tgz
OS X Mavericks binaries: r-oldrel: fastLink_0.1.1.tgz
Old sources: fastLink archive


Please use the canonical form to link to this page.