Data pre-processing

DataClean

Clean PGR passport data.

MergeKW MergePrefix MergeSuffix

Merge keyword strings

Generation of KWIC Index

KWIC

Create a KWIC Index

Probable duplicate set retrieval

ProbDup

Identify Probable Duplicates of Accessions

Set review, modification & validation

DisProbDup

Get disjoint probable duplicate sets

ReviewProbDup

Retrieve probable duplicate set information from PGR passport database for review

ReconstructProbDup

Reconstruct an object of class ProbDup

Adjuncts/Helper functions

ValidatePrimKey

Validate if a data frame column confirms to primary key/ID constraints

DoubleMetaphone

Double Metaphone Phonetic Algorithm

read.genesys

Convert 'Darwin Core - Germplasm' zip archive to a flat file

KWCounts

Generate keyword counts

ParseProbDup

Parse an object of class ProbDup to a data frame.

SplitProbDup

Split an object of class ProbDup.

MergeProbDup

Merge two objects of class ProbDup.

AddProbDup

Add probable duplicate sets fields to the PGR passport database

ViewProbDup

Visualize the probable duplicate sets retrieved in a ProbDup object

print

Prints summary of KWIC object.

print

Prints summary of ProbDup object.

Dataset

data

Sample groundnut PGR passport data