Published June 7, 2023 | Version CRAN_0.2.3.8
Software Open

PGRdup: Discover Probable Duplicates in Plant Genetic Resources Collections

Description

Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.

Notes

To cite package "PGRdup" in publications use:

Files

aravind-j/PGRdup-CRAN_0.2.3.8.zip

Files (1.7 MB)

Name Size Download all
md5:9a4129c63e749f6372965bc674b99e70
1.7 MB Preview Download

Additional details

Related works