Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) <doi:10.1080/01621459.1969.10501049>, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.
Version: | 0.1.1 |
Depends: | data.table, R (≥ 3.6.0) |
Imports: | stringdist, stats, utils, lpSolve, Rcpp, parallel |
LinkingTo: | Rcpp |
Suggests: | simplermarkdown |
Published: | 2022-01-07 |
Author: | Jan van der Laan [aut, cre] |
Maintainer: | Jan van der Laan <r at eoos.dds.nl> |
BugReports: | https://github.com/djvanderlaan/reclin2/issues |
License: | GPL-3 |
URL: | https://github.com/djvanderlaan/reclin2 |
NeedsCompilation: | yes |
SystemRequirements: | C++11 |
Materials: | README |
CRAN checks: | reclin2 results |
Reference manual: | reclin2.pdf |
Vignettes: |
Deduplication using reclin2 Introduction to reclin2 Record linkage using machine learning Using a cluster for record linkage |
Package source: | reclin2_0.1.1.tar.gz |
Windows binaries: | r-devel: reclin2_0.1.1.zip, r-release: reclin2_0.1.1.zip, r-oldrel: reclin2_0.1.1.zip |
macOS binaries: | r-release (arm64): reclin2_0.1.1.tgz, r-oldrel (arm64): reclin2_0.1.1.tgz, r-release (x86_64): reclin2_0.1.1.tgz, r-oldrel (x86_64): reclin2_0.1.1.tgz |
Please use the canonical form https://CRAN.R-project.org/package=reclin2 to link to this page.