PGRdup: Discover Probable Duplicates in Plant Genetic Resources Collections

Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.

Version: 0.2.3.2
Depends: R (≥ 3.0.2)
Imports: data.table (≥ 1.9.3), igraph, stringdist (≥ 0.9.4), stringi, ggplot2, grid, gridExtra, methods, utils, stats
Suggests: diagram, wordcloud, microbenchmark, XML, knitr, rmarkdown
Published: 2017-08-04
Author: J. Aravind [aut, cre], J. Radhamani [aut], Kalyani Srinivasan [aut], B. Ananda Subhash [aut], R. K. Tyagi [aut], ICAR-NBGPR [cph], Maurice Aubrey [ctb], Kevin Atkinson [ctb], Lawrence Philips [ctb]
Maintainer: J. Aravind <j.aravind at icar.gov.in>
BugReports: https://github.com/aravind-j/PGRdup/issues
License: GPL-2 | GPL-3
Copyright: 2014-2017, ICAR-NBPGR
URL: https://github.com/aravind-j/PGRdup
NeedsCompilation: yes
Citation: PGRdup citation info
Materials: README NEWS
CRAN checks: PGRdup results

Downloads:

Reference manual: PGRdup.pdf
Vignettes: Introduction
Package source: PGRdup_0.2.3.2.tar.gz
Windows binaries: r-devel: PGRdup_0.2.3.2.zip, r-release: PGRdup_0.2.3.2.zip, r-oldrel: PGRdup_0.2.3.2.zip
OS X El Capitan binaries: r-release: PGRdup_0.2.3.2.tgz
OS X Mavericks binaries: r-oldrel: PGRdup_0.2.3.2.tgz
Old sources: PGRdup archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=PGRdup to link to this page.