Published March 25, 2021 | Version 2.0
Software Open

Extracting Duplicate Georeferences from Herbarium Specimen Data

  • 1. Cal Poly State University, San Luis Obispo

Description

New version! The March 2021 (2.0) version includes a more informative georeferenceRemarks field in the output table, as well as other improvements. The code documentation has been updated accordingly.

This code was developed to rapidly georeference herbarium specimens by importing georeference data that already exist for duplicate specimens. The code looks through each record in a provided dataset and determines whether that record is found in the omoccurduplicatelink table (i.e., a duplicate has been linked to that record in your Symbiota portal). If a georeferenced duplicate is found, the code will add these data into a new output file (newMycoll) such that the unique identifier (occid), catalog number, other catalog number, collector, and collector number corresponds to those fields from the input dataset, and the georeference data is copied from the duplicate record. The user must then clean the output file so it results in one row per specimen record.

Notes

These materials were made possible by National Science Foundation Award 1802312. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Files

ExtractingDuplicateGeoreferencesDocumentation_v2.pdf

Files (441.0 kB)

Name Size Download all
md5:39a27fafb3da07e7dab3a27b82189c2e
6.3 kB Download
md5:2a2676ddb596571a16518e7e631d8c03
434.6 kB Preview Download