Published February 18, 2021 | Version v2.0
Dataset Open

High-quality carnivoran genomes from roadkill samples enable comparative species delineation in aardwolf and bat-eared fox

  • 1. Institut des Sciences de l'Evolution de Montpellier (ISEM), CNRS, IRD, EPHE, Université de Montpellier, France
  • 2. National Museum and Centre for Environmental Management, University of the Free State, Bloemfontein, South Africa
  • 3. Department of Natural Sciences, National Museums Scotland, Edinburgh, UK
  • 4. CNRS, Sorbonne Université, FR2424, ABiMS, Station Biologique de Roscoff, 29680 Roscoff, France

Description

High-quality carnivore genomes from roadkill samples enable comparative species delineation in aardwolf and bat-eared fox

Rémi Allio1*, Marie-Ka Tilak1, Céline Scornavacca1, Nico L. Avenant2, Andrew C. Kitchener3, Erwan Corre4, Benoit Nabholz1&5, and Frédéric Delsuc1*

 

Affiliations

1Institut des Sciences de l’Evolution de Montpellier (ISEM), CNRS, IRD, EPHE, Université de Montpellier, France remi.allio@umontpellier.fr marie-ka.tilak@umontpellier.fr celine.scornavacca@umontpellier.fr benoit.nabholz@umontpellier.fr frederic.delsuc@umontpellier.fr

2National Museum and Centre for Environmental Management, University of the Free State, Bloemfontein, South Africa navenant@nasmus.co.za

3Department of Natural Sciences, National Museums Scotland, Edinburgh, UK a.kitchener@nms.ac.uk

4CNRS, Sorbonne Université, FR2424, ABiMS, Station Biologique de Roscoff, 29680 Roscoff, France corre@sb-roscoff.fr

5Institut Universitaire de France (IUF)

*Correspondence: remi.allio@umontpellier.fr, frederic.delsuc@umontpellier.fr

 

Running head

Genomics from roadkill samples

 

Abstract

In a context of continuing erosion of biodiversity, obtaining genomic resources from wildlife is becoming essential for conservation. The many thousands of mammals killed on roads annually could potentially provide a useful source of material for genomic surveys. To illustrate the potential of this underexploited resource, we used roadkill samples to sequence reference genomes and study the genomic diversity of the bat-eared fox (Otocyon megalotis) and the aardwolf (Proteles cristatus), for which subspecies have been defined based on similar disjunct distributions in Eastern and Southern Africa. By developing an optimized DNA extraction protocol, we successfully obtained long reads using the Oxford Nanopore Technologies (ONT) MinION device. For the first time in mammals, we obtained two reference genomes with high contiguity and gene completeness by combining ONT long reads with Illumina short reads using a hybrid assembly approach. Based on re-sequencing data from a few other roadkill samples, the comparison of the genetic differentiation between our two pairs of subspecies to that of pairs of well-defined species across the Carnivora showed that the two subspecies of aardwolf might warrant species status (P. cristatus and P. septentrionalis), whereas the two subspecies of bat-eared fox might not. Moreover, using these data, we conducted demographic analyses that revealed similar trajectories between Eastern and Southern populations of both species, suggesting that their population sizes have been shaped by similar environmental fluctuations. Finally, we obtained a well resolved genome-scale phylogeny for the Carnivora, with evidence for incomplete lineage sorting among the three main arctoid lineages. Overall, our cost-effective strategy opens the way for large-scale population genomic studies and phylogenomics of mammalian wildlife using roadkill.

 

Figures & Tables

Figure 1. Disjunct distributions of the aardwolf (Proteles cristatus) and the bat-eared fox (Otocyon megalotis) in Eastern and Southern Africa. Within each species, two subspecies have been recognized based on their distributions and morphological differences (Clark, 2005; Koehler and Richardson, 1990). Picture credits: Southern aardwolf (P. cristatus cristatus) copyright Dominik Käuferle; Southern bat-eared fox (O. megalotis megalotis) copyright Derek Keats.

Figure 2. Representation of the mitochondrial genetic diversity within the Carnivora with a) the mitogenomic phylogeny inferred from 142 complete Carnivora mitogenomes, including those of the two populations of aardwolf (Proteles cristatus) and bat-eared fox (Otocyon megalotis) and b) intraspecific (orange) and the interspecific (red) genetic diversities observed for the two mitochondrial markers COX1 and CYTB. Silhouettes from http://phylopic.org/.

Figure 3. Genetic differentiation indices obtained from a comparison of intraspecific (orange) and interspecific (red) polymorphisms in four pairs of well-defined Carnivora species and for the subspecies of aardwolf (Proteles cristatus) and bat-eared fox (Otocyon megalotis) (grey). Silhouettes from http://phylopic.org/.

Figure 4. PSMC estimates of changes in effective population size over time for the Eastern (orange) and Southern (blue and purple) populations of a) bat-eared fox and ) aardwolf. mu = mutation rate of 10-8 mutations per site per generation and g = generation time of 2 years. Vertical red lines indicate 20 kyrs and 40 kyrs. Silhouettes from http://phylopic.org/.

Figure 5. Phylogenomic tree reconstructed from the nucleotide supermatrix composed of 14,307 single-copy orthologous genes for 52 species of Carnivora plus one outgroup (Manis javanica). The family names in the legend are ordered as in the phylogeny. Silhouettes from http://phylopic.org/.

Figure 6. Phenotypic comparisons, highlighting the differences in fur coloration and stripe pattern, between captive individuals of Eastern (P. septentrionalis) and Southern (P. cristatus) aardwolves held at Hamerton Zoo Park (UK). All pictures copyright and used with permission from Rob Cadd.

Table 1. Summary of sequencing and assembly statistics of the genomes generated in this study.

 

Additional files

Figure S1: Graphical representation of the convergence of the PhyloBayes chains run for dating analyses using a) Clock model, b) LN model, and c) UGAM model.

Figure S2: Plot of the quality of Nanopore long reads base-called with either the fast or the high accuracy option of Guppy v3.1.5. The quality of the base-calling step has a large impact on the final quality of the assemblies by reducing the number of contigs and increasing the N50 value.

Figure S3: Comparison of 503 mammalian genome assemblies from 12 taxonomic groups using bean plots of the a) number of scaffolds, and b) scaffold N50 values ranked by median values. Thick black lines show the medians, dashed black lines represent individual data points, and polygons represent the estimated density of the data. Note the log scale on the Y axes. The bat-eared fox (Otocyon megalotis megalotis) and aardwolf (Proteles cristatus cristatus) assemblies produced in this study using SOAPdenovo and MaSuRCA are indicated by asterisks. Bean plots were computed using BoxPlotR (Spitzer et al., 2014).

Figure S4: BUSCO completeness assessment of 67 Carnivora genome assemblies visualized as bar charts representing percentages of complete single-copy (light blue), complete duplicated (dark blue), fragmented (yellow), and missing (red) genes ordered by increasing percentage of total complete genes. The bat-eared fox (Otocyon megalotis megalotis) and aardwolf (Proteles cristatus cristatus) assemblies produced in this study using MaSuRCA and SOAPdenovo are indicated by asterisks.

Figure S5. Genetic differentiation indices obtained from a comparison of intraspecific and interspecific polymorphisms after having homogenized the coverage of all species (at about 15x). The estimates were calculated for four pairs of well-defined Carnivora species and for the subspecies of aardwolf (Proteles cristatus) and bat-eared fox (Otocyon megalotis). Silhouettes from http://phylopic.org/.

Figure S6: Genetic differentiation indices obtained from the comparison of intraspecific and interspecific polymorphisms for the pair Ursus arctos/Ursus maritimus (~10 replicates per species). GDI is estimated for each pair of individuals. This result demonstrates that randomly picking only three individuals (out of 10) is sufficient to accurately estimate the level of genetic differentiation between the two species. 

Figure S7: Definition of the genetic differentiation index (GDI) based on the F-statistic (FST). The main difference between these two indexes is the use of heterozygous allele states for GDI rather than real polymorphism for the FST. Green = πwithin, Orange = πbetween, Blue = Population A, Red = Population A+B.

Figure S8: Graphical representation (BlobPlot) of the results of contamination analyses performed with BlobTools for a) the aardwolf (Proteles cristatus cristatus) and b) the bat-eared fox (Otocyon megalotis megalotis) genome assemblies.

Table S1: Pairwise patristic distances estimated for the 142 species based on branch lengths of the phylogenetic tree inferred with the 15 mitochondrial loci (2 rRNAs and 13 protein-coding genes).

Table S2: Results of Bayesian dating for the two nodes leading to the Proteles cristatus sspp. and the Otocyon megalotis sspp.. Divergence time estimates based on UGAM and LN models are reported with associated 95% credibility intervals for each MCMC chain.

Table S3: Sample details and assembly statistics (Number of contigs/scaffolds and associated N50 values) for the 503 mammalian assemblies retrieved from NCBI (https://www.ncbi.nlm.nih.gov/assembly) on August 13th, 2019 with filters: “Exclude derived from surveillance project”, “Exclude anomalous”, “Exclude partial”, and using only the RefSeq assembly for Homo sapiens.

Table S4: Genome completeness assessment of MaSuRCA and SOAPdenovo assemblies obtained for Proteles cristatus cristatusand Otocyon megalotis megalotis together with the 63 carnivoran assemblies available at NCBI on August 13th, 2019 using Benchmarking Universal Single-Copy Orthologs (BUSCO) v3 with the Mammalia OrthoDB 9 BUSCO gene set.

Table S5: Annotation summary and supermatrix composition statistics of the 53 species used to infer the genome-scale Carnivora phylogeny.

Table S6: Sample details and assembly statistics of the 13 newly assembled carnivoran mitochondrial genomes.

Table S7: Node calibrations used for the Bayesian dating inferences based on mitogenomic data.

Table S8: Results of contamination analyses performed with BlobTools for the aardwolf (Proteles cristatus cristatus).

Table S9: Results of contamination analyses performed with BlobTools for the bat-eared fox (Otocyon megalotis megalotis).

Table S10: Summary information for the Carnivora genomes available either on GenBank, DNA Zoo and the OrthoMaM database as of February 11th, 2020. The “OMM” column indicates if the genome was available on OMM (yes) or not (no). The “Annotation” column indicates whether the genome was already annotated (yes) or not (no).

Supplementary File 1: Analyses of morphological differences between the two proposed species of aardwolf.

 

Zenodo supplementary files

MS_Genomes_Roadkill.sh contains the main command lines used for the study

1- Mitogenomics

Barcoding gap analyses: Barcoding gap analysis.zip contains the COX1 and CYTB matrices used for the barcoding gap analyses + the final trees and their associated IQ-TREE log files

Mitochondrial phylogeny: Mitochondrial phylogeny.zip contains the mitochondrial supermatrix used to generate de phylogeny + the mitochondrial phylogeny and the associated IQ-TREE log file. 

Dating: Mitochondrial dating.zip contains the results of the dating analyses.

2- Genomics

Assemblies.tar.gz contains the hybrid assemblies of Proteles cristata cristata and Otocyon megalotis megalotis.

Busco analyses:  Busco_Carnivora.xlsx contains genomes information like Accession numbers, Species names, Busco scores etc...

Genetic differentiation: Genetic differentiation analyses.zip contains the coordinates of the regions used for the estimations for each genomes + a summary of the results for each species pair. 

PSMC analyses: PSMC.zip contains the results of the PSMC analyses.

3- Phylogenomics 

Phylogenomic analyses: Phylogenomic analysis.zip contains the nucleotides supermatrix composed of 14,307 single-copy orthologous genes for 54 species of carnivores plus one outgroup (Manis javanica) + the phylogenomic tree and the associated IQ-TREE log file.

Coalescent analyses: ASTRAL-III.zip contains the results obtained from Astral analyses.

Gene trees: Gene trees.zip contains the gene trees.

 

Files

Barcoding gap analysis.zip

Files (2.0 GB)

Name Size Download all
md5:abcc0de6377b5f8cc4b97714650f5854
48.2 kB Download
md5:d54be66217cb3d7b7b8cc4043a8c5817
1.4 GB Download
md5:e470df33d6cf1b6b02260ea1660bb54a
1.9 MB Preview Download
md5:29a2d9b30b7bd7a94f684c60facd3e1b
19.6 MB Preview Download
md5:168ad5e81d701a248cb47aa44e7775cb
844.4 kB Preview Download
md5:cf9f4ac20ea2a8ad1b37deb8d9bb61a5
20.9 MB Preview Download
md5:c8738385136a8481e5988ac092303d2e
449.2 kB Preview Download
md5:dbe09c4fba9928139e60f0aa59d80f5a
704.7 kB Preview Download
md5:1303a70a9086273f893f2415d616ebc7
3.3 MB Preview Download
md5:6a4247e3cae7a3cf38ef5559ead5c7a9
1.1 MB Preview Download
md5:74323979000161156bc384b89e4fd186
187.0 kB Preview Download
md5:3cd3e0732453f4ef6ca0b66010032f11
423.5 kB Preview Download
md5:1f19ef59478ad8f263ace7d6f2e79896
311.0 kB Preview Download
md5:3bd27701ba6fc2d3d0725f188bfb5cdb
20.0 MB Preview Download
md5:0fc741e55ca06b911e87a64846e17236
384.7 kB Preview Download
md5:a7e932e642074ff769d34c94838490ea
72.1 kB Preview Download
md5:e873fb4108b43810e45e02da79c704d9
6.2 MB Preview Download
md5:e5ca549356f49067a6197bd24c6756dd
25.0 MB Preview Download
md5:7fe6dc4434131516d401c006478ed95e
58.0 kB Preview Download
md5:4efb99c481248c669152c7d4f8b26c65
47.7 kB Preview Download
md5:f9073f9cb28129ec855523dd930aa71a
579.6 kB Preview Download
md5:58177db437a3ffc891e1cf97536d1795
123.0 kB Download
md5:dab964c52a4e55388c99195ea78fe2bb
434.6 MB Preview Download
md5:e681fb0f602661c9efdabb102e97433b
19.5 MB Preview Download
md5:416cdef39d50f3dfc79ca1aa267785ce
2.3 MB Preview Download
md5:83e879f790aa86fbe25885c60bad04cb
62.6 kB Preview Download
md5:d3ee4f6ca8de996bf4040c9f4c4a94a6
286.5 kB Download
md5:fac6c927b70637a42479687f7561316a
59.6 kB Download
md5:d1cb86a99f413989d4eb53cc3eb7e7e7
9.5 kB Download
md5:72c4e60c001921d2c6ae6ce639f4f1c0
113.3 kB Download
md5:2349d2e681497cd0b70d36758d34ed97
76.8 kB Download
md5:aea9a93236b74fcea7b5f307af789ac7
11.4 kB Download
md5:dcbaf560a0bfb574758e0a7650312d1d
99.1 kB Download
md5:e0c49e0287478187fc0da2812d3320d4
9.3 kB Download
md5:4ce9c7e760524620bfa4533f26bc0f1d
960.8 kB Download
md5:2c5d426ba57111d11f1cb15a6da5bc0c
1.8 MB Download

Additional details

Related works

Is supplemented by
Journal article: 10.7554/eLife.63167 (DOI)

Funding

ConvergeAnt – An Integrative Approach to Understanding Convergent Evolution in Ant-eating Mammals 683257
European Commission