File uploads: We have fixed an issue which caused file uploads to fail. We apologise for the inconvenience it may have caused.

Published September 22, 2021 | Version v1
Software Open

Data from: Multispecies site occupancy modeling and study design for spatially replicated environmental DNA metabarcoding

  • 1. National Institute for Environmental Studies

Description

Although environmental DNA (eDNA) metabarcoding has become widely applied to gauge ecosystems in a noninvasive and cost-efficient manner, false negatives can occur due to various factors in its inherent multistage workflow. It is therefore essential to deal with this kind of species detection errors in eDNA metabarcoding to achieve accurate assessment of species distribution and diversity. To address this issue, we proposed a variant of the multispecies site occupancy model for eDNA metabarcoding studies and applied it to an eDNA metabarcoding dataset of freshwater fish communities collected in the Kasumigaura watershed in Japan.

Notes

This is an archive of the dataset used for the analysis and script files needed to reproduce the results. There are the following five files:

  • 1_model_fitting.R: R script for fitting the multispecies site occupancy model proposed in the paper to an eDNA metabarcoding dataset of fish communities. The second half of the file also contains scripts for assessing and plotting the model-fit results.
  • 2_decision_analysis.R: R script for estimating the effectiveness of species detection under various study designs based on the model-fit results. The second half of the file contains scripts for drawing profiles of the estimated effectiveness of the species detection.
  • data.Rdata: This file contains sequence read count data for 50 fish species groups obtained via environmental DNA metabarcoding at 50 sites in the Kasumigaura watershed, Japan. The file is in binary format written out using the save() function in R. In addition to a three-dimensional array of sequence reads with species, site, and replicate dimensions, there are two vectors of covariates (riverbank and mismatch) used to explain the variation in the sequence read counts.
  • functions.R: This file defines auxiliary functions for the analyses performed using the two R scripts (1_model_fitting.R and 2_decision_analysis.R).
  • model.jags: JAGS model file that defines the multispecies site occupancy model to be fitted in the R script 1_model_fitting.R.

Funding provided by: Environmental Restoration and Conservation Agency
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100014423
Award Number: ERTDF, Nos. 4–1705 and 2–2001

Funding provided by: Japan Society for the Promotion of Science
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100001691
Award Number: KAKENHI, Nos 20H03010 and 20K06102

Files

Files (40.6 kB)

Name Size Download all
md5:6aa803ba4247e7e7c7c12456f9434809
4.4 kB Download
md5:75b5b8ea230086dc09d67973d4b5c171
7.8 kB Download
md5:c8536d4ae7afb66d590b0e7ce4a485f6
26.4 kB Download
md5:9f52a1ccb020916220a4da5f5ba7a270
1.9 kB Download

Additional details

Related works

Is source of
10.5061/dryad.3bk3j9kkm (DOI)