Script and data from: The best of two worlds: toward large-scale monitoring of biodiversity combining metabarcoding and optimised parataxonomic validation.
Creators
Description
Description
Zenodo linked to : Penel, B., Meynard, C.N., Benoit, L., Bourdonné, A., Clamens, A., Soldati, L., Migeon, A., Chapuis, M.-P., Piry, S., Kergoat, G. and Haran, J. (2025), The best of two worlds: toward large-scale monitoring of biodiversity combining COI metabarcoding and optimized parataxonomic validation. Ecography, 2025: e07699. https://doi.org/10.1111/ecog.07699
Publication abstract
In a context of unprecedented insect decline, it is critical to have reliable monitoring tools to measure species diversity and their dynamic at large-scales. High-throughput DNA-based identification methods, and particularly metabarcoding, were proposed as an effective way to reach this aim. However, these identification methods are subject to multiple technical limitations, resulting in unavoidable false-positive and false-negative species detection. Moreover, metabarcoding does not allow a reliable estimation of species abundance in a given sample, which is key to document and detect population declines or range shifts at large scales. To overcome these obstacles, we propose here a Human-Assisted Molecular Identification (HAMI) approach, a framework based on a combination of metabarcoding and image-based parataxonomic validation of outputs and recording of abundance. We assessed the advantages of using HAMI over the exclusive use of a metabarcoding approach by examining 492 mixed beetle samples from a biodiversity monitoring initiative conducted throughout France. On average, 23% of the species are missed when relying exclusively on metabarcoding, this percent being consistently higher in species-rich samples. Importantly, on average, 20% of the species identified by molecular-only approaches correspond to false positives linked to cross-sample contaminations or mis-identified barcode sequences in databases. The combination of molecular methodologies and parataxonomic validation in HAMI significantly reduces the intrinsic biases of metabarcoding and recovers reliable abundance data. This approach also enables users to engage in a virtuous circle of database improvement through the identification of specimens associated with missing or incorrectly assigned barcodes. As such, HAMI fills an important gap in the toolbox available for fast and reliable biodiversity monitoring at large scales.
File description:
MiSeq raw sequences of the COI barcode from 492 Coleoptera field samples :
Files
Raw_sequencage_data.zip
Files
(4.8 GB)
Name | Size | Download all |
---|---|---|
md5:4a9c848f30bd0458d54701ac0cac34e5
|
2.1 MB | Preview Download |
md5:f3fe71cfb553a9b8c65dd63fe4e8ee41
|
32.2 MB | Preview Download |
md5:c4ae1ce4d4cf871bdf110ee05cec1c19
|
251.2 kB | Preview Download |
md5:3ee5752fc75ae58fdce88f61503af1cb
|
4.8 GB | Preview Download |
md5:0bf855449797ec7292cab63fe8680f25
|
2.7 MB | Preview Download |
Additional details
Funding
Software
- Repository URL
- https://github.com/BenoitPenel/PEWO-1
- Programming language
- Python, R, Snakemake
- Development Status
- Active