Published February 23, 2023 | Version v1
Journal article Open

Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository

  • 1. Laboratory of Genomics, Maurice Lamontagne Institute, Fisheries and Oceans Canada, Mont-Joli, Canada
  • 2. Maurice Lamontagne Institute, Fisheries and Oceans Canada, Mont-Joli, Canada

Description

Biodiversity assessments relying on DNA have increased rapidly over the last decade. However, the reliability of taxonomic assignments in metabarcoding studies is variable and affected by the reference databases and the assignment methods used. Species level assignments are usually considered as reliable using regional libraries but unreliable using public repositories. In this study, we aimed to test this assumption for metazoan species detected in the Gulf of St. Lawrence in the Northwest Atlantic. We first created a regional library (GSL-rl) by data mining COI barcode sequences from BOLD, and included a reliability ranking system for species assignments. We then estimated 1) the accuracy and precision of the public repository NCBI-nt for species assignments using sequences from the regional library and 2) compared the detection and reliability of species assignments of a metabarcoding dataset using either NCBI-nt or the regional library and popular assignment methods. With NCBI-nt and sequences from the regional library, the BLAST-LCA (least common ancestor) method was the most precise method for species assignments, but the accuracy was higher with the BLAST-TopHit method (>80% over all taxa, between 70% and 90% amongst taxonomic groups). With the metabarcoding dataset, the reliability of species assignments was greater using GSL-rl compared to NCBI-nt. However, we also observed that the total number of reliable species assignments could be maximized using both GSL-rl and NCBI-nt with different optimized assignment methods. The use of a two-step approach for species assignments, i.e., using a regional library and a public repository, could improve the reliability and the number of detected species in metabarcoding studies.

Files

MBMG_article_98539.pdf

Files (765.4 kB)

Name Size Download all
md5:bd498ff4b5b9fbe3d57054d02724d98c
765.4 kB Preview Download

System files (176.0 kB)

Name Size Download all
md5:31d7d6dad33eb7fd0643f1d317f37eef
176.0 kB Download

Linked records

Additional details

Related works

Has part
Other: 10.3897/mbmg.7.98539.suppl1 (DOI)