Protocol and R script for assembling a quality-controlled occurrence dataset

Legume Occurrences Working Group; Ringelberg, Jens; Gagnon, Edeline; Miller, Joe

doi:10.5281/zenodo.10513140

Published January 15, 2024 | Version v1

Software Open

Protocol and R script for assembling a quality-controlled occurrence dataset

This occurrence data assembly and cleaning protocol was put together for the Legume Occurrences Working Group (https://www.legumedata.org/working-groups/occurrences/).

The two most important files are 'Protocol for assembling occurrence datasets.docx', which explains the rationale behind the data download and cleaning processes, and 'GBIF quality control protocol.R', which provides step-by-step R functions to assemble a cleaned occurrence dataset. The R script is primarily written for data downloaded from GBIF, but with some minor tweaks will also work on data from other sources.

If you use this protocol or script for a publication, it would be appreciated if you could cite the paper that provided the first version of this script, Ringelberg et al. 2020 (https://doi.org/10.1111/geb.13089).

Files

Centroids.csv

Files (108.2 kB)

Name	Size	Download all
Centroids.csv md5:2fd20178dc0ca5e6445b9b841720e7ec	5.1 kB	Preview Download
Example species list.csv md5:e4d2ab4447e78f9fa58f1a7f6592c1b3	24.4 kB	Preview Download
GBIF quality control auxiliary functions.R md5:f6066942693aac3fd6de3e59b5f9637a	2.2 kB	Download
GBIF quality control protocol.R md5:0ccda767cf7aa6cff260b1366ba6ea75	39.3 kB	Download
Protocol for assembling occurrence datasets.docx md5:420f9cf2eb6432e7b03b7444cade50e5	37.2 kB	Download

	All versions	This version
Views	91	91
Downloads	175	175
Data volume	3.7 MB	3.7 MB

Protocol and R script for assembling a quality-controlled occurrence dataset

Creators

Description

Files

Centroids.csv

Files (108.2 kB)