Published January 15, 2024 | Version v1
Software Open

Protocol and R script for assembling a quality-controlled occurrence dataset

Description

This occurrence data assembly and cleaning protocol was put together for the Legume Occurrences Working Group (https://www.legumedata.org/working-groups/occurrences/).

The two most important files are 'Protocol for assembling occurrence datasets.docx', which explains the rationale behind the data download and cleaning processes, and 'GBIF quality control protocol.R', which provides step-by-step R functions to assemble a cleaned occurrence dataset. The R script is primarily written for data downloaded from GBIF, but with some minor tweaks will also work on data from other sources.

If you use this protocol or script for a publication, it would be appreciated if you could cite the paper that provided the first version of this script, Ringelberg et al. 2020 (https://doi.org/10.1111/geb.13089).

Files

Centroids.csv

Files (108.2 kB)

Name Size Download all
md5:2fd20178dc0ca5e6445b9b841720e7ec
5.1 kB Preview Download
md5:e4d2ab4447e78f9fa58f1a7f6592c1b3
24.4 kB Preview Download
md5:f6066942693aac3fd6de3e59b5f9637a
2.2 kB Download
md5:0ccda767cf7aa6cff260b1366ba6ea75
39.3 kB Download
md5:420f9cf2eb6432e7b03b7444cade50e5
37.2 kB Download