Published May 31, 2018 | Version v1
Dataset Open

Data from: Counting with DNA in metabarcoding studies: how should we convert sequence reads to dietary data?

  • 1. Australian Antarctic Division
  • 2. Science Department Smith‐Root Inc. Vancouver Washington*
  • 3. University of Tasmania
  • 4. University of Helsinki
  • 5. Queen Mary University of London
  • 6. Brown University
  • 7. Commonwealth Scientific and Industrial Research Organisation

Description

Advances in DNA sequencing technology have revolutionised the field of molecular analysis of trophic interactions and it is now possible to recover counts of food DNA sequences from a wide range of dietary samples. But what do these counts mean? To obtain an accurate estimate of a consumer's diet should we work strictly with datasets summarising frequency of occurrence of different food taxa, or is it possible to use relative number of sequences? Both approaches are applied to obtain semi-quantitative diet summaries, but occurrence data is often promoted as a more conservative and reliable option due to taxa-specific biases in recovery of sequences. We explore representative dietary metabarcoding datasets and point out that diet summaries based on occurrence data often overestimate the importance of food consumed in small quantities (potentially including low-level contaminants) and are sensitive to the count threshold used to define an occurrence. Our simulations indicate that using relative read abundance (RRA) information often provide a more accurate view of population-level diet even with moderate recovery biases incorporated; however, RRA summaries are sensitive to recovery biases impacting common diet taxa. Both approaches are more accurate when the mean number of food taxa in samples is small. The ideas presented here highlight the need to consider all sources of bias and to justify the methods used to interpret count data in dietary metabarcoding studies. We encourage researchers to continue addressing methodological challenges, and acknowledge unanswered questions to help spur future investigations in this rapidly developing area of research.

Notes

Files

R code for figures and data.zip

Files (945.3 kB)

Name Size Download all
md5:ee4671c20868ebd279f0018138118f97
87.8 kB Preview Download
md5:daf281d2ebaeda86da535100f388ce75
423.1 kB Preview Download
md5:daf281d2ebaeda86da535100f388ce75
423.1 kB Preview Download
md5:39243187f55f5d9a9bc63eded9d54f7f
11.4 kB Download

Additional details

Related works

Is cited by
10.1111/mec.14734 (DOI)