There is a newer version of this record available.

Software Open Access

Extracting Duplicate Georeferences from Herbarium Specimen Data

Pearson, Katelin D.

This code was developed to rapidly georeference herbarium specimens by importing georeference data that already exist for duplicate specimens. The code looks through each record in a provided dataset and determines whether that record is found in the omoccurduplicatelink table (i.e., a duplicate has been linked to that record in your Symbiota portal). If a duplicate is found, it determines whether the duplicate is georeferenced. If none of the duplicates are georeferenced, the code moves to the next specimen in the provided dataset. If a georeferenced duplicate is found, the code will add these data into a new output file (newMycoll) such that the unique identifier (occid) corresponds to the occid in the input dataset and the georeference data is copied from the duplicate record. The user must then clean the output file so it results in one row per occid/specimen record).

These materials were made possible by National Science Foundation Award 1802312. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
Files (770.8 kB)
Name Size
5.3 kB Download
765.6 kB Download
All versions This version
Views 372254
Downloads 286200
Data volume 176.8 MB142.5 MB
Unique views 342234
Unique downloads 250176


Cite as