Published May 30, 2023 | Version v1
Software Open

Data and code for: Building use-inspired species distribution models: using multiple data types to examine and improve model performance

  • 1. Woods Hole Oceanographic Institution
  • 2. San Diego State University
  • 3. National Oceanic and Atmospheric Administration
  • 4. University of the Azores
  • 5. Gulf of Maine Research Institute
  • 6. University of California, Santa Cruz
  • 7. University of Porto
  • 8. Massachusetts Division of Marine Fisheries*
  • 9. Marine Biological Association of the United Kingdom

Description

Species distribution models (SDMs) are becoming an important tool for marine conservation and management. Yet while there is an increasing diversity and volume of marine biodiversity data for training SDMs, little practical guidance is available on how to leverage distinct data types to build robust models. We explored the effect of different data types on the fit, performance and predictive ability of SDMs by comparing models trained with four data types for a heavily exploited pelagic fish, the blue shark (Prionace glauca), in the Northwest Atlantic: two fishery-dependent (conventional mark-recapture tags, fisheries observer records) and two fishery-independent (satellite-linked electronic tags, pop-up archival tags). We found that all four data types can result in robust models, but differences among spatial predictions highlighted the need to consider ecological realism in model selection and interpretation regardless of data type. Differences among models were primarily attributed to biases in how each data type, and the associated representation of absences, sampled the environment and summarized the resulting species distributions. Outputs from model ensembles and a model trained on all pooled data both proved effective for combining inferences across data types and provided more ecologically realistic predictions than individual models. Our results provide valuable guidance for practitioners developing SDMs. With increasing access to diverse data sources, future work should further develop truly integrative modeling approaches that can explicitly leverage strengths of individual data types while statistically accounting for limitations, such as sampling biases. 

Notes

Funding provided by: NASA Headquarters
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100017437
Award Number: 80NSSC19K0187

Files

pipelines.zip

Files (200.4 kB)

Name Size Download all
md5:a5569a1e33eec008f3e29a86894c23fa
51.3 kB Download
md5:fe04f51aefee82a957e30293fa45dae5
114.0 kB Preview Download
md5:050b475dca20da211c488fe2696bf41f
35.0 kB Preview Download

Additional details

Related works

Is source of
10.5061/dryad.h44j0zpr2 (DOI)