There is a newer version of the record available.

Published April 21, 2023 | Version v1
Dataset Open

Metagenomics uncovers dietary adaptations for chitin digestion in the gut microbiota of convergent myrmecophagous mammals

  • 1. Sophie
  • 2. Marie-Ka
  • 3. Amandine
  • 4. Roxane
  • 5. Nora. M
  • 6. Wendy
  • 7. Andrea
  • 8. W. J.
  • 9. Nico L
  • 10. Benoit
  • 11. Guillaume
  • 12. Frédéric

Description

Metagenomics uncovers dietary adaptations for chitin digestion in the gut microbiota of convergent myrmecophagous mammals

Sophie Teulleta,#, Marie-Ka Tilaka, Amandine Magdeleinea, Roxane Schaubb,c, Nora M. Weyerd, Wendy Panainod,e, Andrea Fullerd, W. J. Loughryf, Nico L. Avenantg, Benoit de Thoisyh,i, Guillaume Borrelj and Frédéric Delsuca,#

aInstitut des Sciences de l’Evolution de Montpellier (ISEM), Univ Montpellier, CNRS, IRD, Montpellier, France

bCIC AG/Inserm 1424, Centre Hospitalier de Cayenne Andrée Rosemon, Cayenne, French Guiana

cTropical Biome and immunopathology, Université de Guyane, Labex CEBA, DFR Santé, Cayenne, French Guiana

dBrain Function Research Group, School of Physiology, University of the Witwatersrand, Johannesburg, South Africa

eCentre for African Ecology, School of Animals, Plant, and Environmental Sciences, University of the Witwatersrand, Johannesburg, South Africa

fDepartment of Biology, Valdosta State University, Valdosta, GA, USA

gNational Museum and Centre for Environmental Management, University of the Free State, Bloemfontein, South Africa

hInstitut Pasteur de la Guyane, Cayenne, French Guiana, France

iKwata NGO, Cayenne, French Guiana, France

jInstitut Pasteur, Université Paris Cité, UMR CNRS 6047, Evolutionary Biology of the Microbial Cell, Paris, France

#Corresponding authors: sophie.teullet@umontpellier.fr; frederic.delsuc@umontpellier.fr

 

Abstract

In mammals, myrmecophagy (ant and termite consumption) represents a striking example of dietary convergence. This trait evolved independently at least five times in placentals with myrmecophagous species comprising aardvarks, anteaters, some armadillos, pangolins, and aardwolves. The gut microbiome plays an important role in dietary adaptation, and previous analyses of 16S rRNA metabarcoding data have revealed convergence in the composition of the gut microbiota among some myrmecophagous species. However, the functions performed by these gut bacterial symbionts and their potential role in the digestion of prey chitinous exoskeletons remain open questions. Using long- and short-read sequencing of fecal samples, we generated 29 gut metagenomes from nine myrmecophagous and closely related insectivorous species sampled in French Guiana, South Africa, and the USA. From these, we reconstructed 314 high-quality bacterial genome bins of which 132 carried chitinase genes, highlighting their potential role in insect prey digestion. These chitinolytic bacteria belonged mainly to the family Lachnospiraceae, and some were likely convergently recruited in the different myrmecophagous species as they were detected in several host orders (i.e., Enterococcus faecalis, Blautia sp), suggesting that they could be directly involved in the adaptation to myrmecophagy. Others were found to be more host-specific, possibly reflecting phylogenetic constraints and environmental influences. Overall, our results highlight the potential role of the gut microbiome in chitin digestion in myrmecophagous mammals and provide the basis for future comparative studies performed at the mammalian scale to further unravel the mechanisms underlying the convergent adaptation to myrmecophagy.

 

Figures and Tables

Main_figures.zip

  • Figure. 1. Phylogenetic position of the 314 high-quality selected bins reconstructed from 29 gut metagenomes of the nine focal myrmecophagous species within a reference prokaryotic phylogeny. A: Phylogeny of the 314 selected bins (red branches) with 2496 prokaryote reference genomes. Circles respectively indicate (from inner to outer circles): the bacterial phyla and kingdom to which these genome bins were assigned based on the Genome Taxonomy Database (Chaumeil et al, 2020). Clades, where a subtree was defined, are highlighted in blue for the Firmicutes (Fig 1B), green for the Bacteroidetes, and pink for the Proteobacteria (Figs S1 A and B respectively). B: Subtree within Firmicutes showing myrmecophagous-specific clades (blue highlights; dark blue corresponds to the three clades mentioned in the results, light blue to the other clades). The outer circle indicates the bacterial family to which these genome bins were assigned based on the Genome Taxonomy Database.

  •  

    Figure. 2. Phylogeny of the 394 GH18 sequences identified in 132 high-quality selected bins reconstructed from 29 gut metagenomes of the nine focal myrmecophagous species and relatives. Red branches indicate the 237 sequences having an active chitinolytic site (DXXDXDXE). Circles respectively indicate (from inner to outer circles): the bacterial family and phyla of the bin the sequence was retrieved from. Colored sequence names indicate the host species. Colored circles at certain nodes indicate enzymes to which sequences are similar when blasting them against the NCBI non-redundant protein database.

  • Figure 3. Detection of the 314 high-quality selected bins (lines) in the 29 gut metagenomes (columns) of the nine focal species. Each square indicates the detection of a bin in a sample as estimated by anvi’o v7 (Eren et al, 2021). Names of bins are indicated on the left with red indicating bins detected in at least one soil sample (detection > 0.25) (Fig S4 and detection table available via Zenodo). Phylogenetic relationships of host species, distinguished by different color strips, are represented at the bottom of the graph. Columns on the right indicate (from left to right): the number of GH18 sequences identified in each bin (from 0 to 17), the bin’s taxonomic phylum, class, order, and family. The phylogeny of the 314 selected bins inferred with PhyloPhlAn v3.0.58 (Asnicar et al, 2020) is also represented on the right of the graph (see Fig S2). Silhouettes were downloaded from phylopic.org.

  • Figure 4. Distribution of chitinolytic selected bins (red links) among the nine focal myrmecophagous species and relatives. Phylogenies of the 314 high-quality selected bins (Fig S2) and of the nine host species (downloaded from timetree.org) are represented respectively on the left and the right of the graph. Links illustrate, for each bin, in which host species the bin was detected (detection threshold > 0.25). Red links indicate bins in which at least one GH18 sequence with an active chitinolytic site (DXXDXDXE) was found (chitinolytic bins). The size of the circles at the tips of the host phylogeny is proportional to the number of samples (n = 1 for D. kap; n = 2 for D. nov, C. uni and M. tri; n = 3 for T. tet and O. afe; n = 4 for D. sp. nov FG; n = 6 for P. cri and S. tem). Bins’ names are indicated at the tip of the bins’ phylogeny and main bacterial phyla are indicated by colored vertical bars. This graph was done with the cophylo R package within the phytools suite (Revell, 2012). Silhouettes downloaded from phylopic.org.

 

  • Table 1. Detailed sample information for the 33 fecal samples collected.

 

Supplementary Materials

Supplementary_material_Teullet_etal_2023.pdf contains supplementary figures (S1-4) and tables (S2-3).

Supplementary_results_Teullet_etal_2023.pdf includes a comparison of genome statistics of the selected bins reconstructed from the long-read vs the short-read datasets, a phylogeny of the set of selected bins before dereplication (n = 407) and a comparison of the distribution of shared and specific genome bins carrying GH18 among host orders.

Supplementary_material_files.zip contains a file for each of the supplementary material figures (S1-4) and tables (S2-3).

Supplementary_results_files.zip contains a file for each of the three figures presented in the supplementary results.

Table S1. Raw results of the different analyses conducted on each gut metagenome to reconstruct high-quality genome bins from raw metagenomic data for each dataset (long- and short-reads).

 

Zenodo supplementary files

  • Assemblies

Long-read_metagenomic_assemblies_polished.zip contains 31 long-read metagenomes assembled with metaFlye strain v2.9 and polished with short reads using Pilon v1.4, which were used for binning.

Long-read_metagenomic_assemblies_not_polished.zip contains 33 long-read metagenomes assembled with metaFlye strain v2.9 before polishing.

Short-read_metagenomic_assemblies.zip contains 31 short-read metagenomes assembled with metaSPAdes and MEGAHIT.

N.B:

  1. Two samples were not sequenced using Illumina short reads (DASY_M1746 and DASY_VLD168), only long reads were generated and assembled for these two samples and are made available here. As these assemblies could not be polished, these samples were not included in downstream analyses.
  2. Two samples were highly contaminated by host reads (CAB_M3141 and MYR_M5295) and not used for downstream analyses. As they were still assembled with the other samples, the corresponding metagenomes are made available here.

 

  • Binning: genome bins and dereplication results

High-quality_selected_bins_dereplicated.zip contains the 314 high-quality selected bins (>90% completion, <5% redundancy) reconstructed from long- and short-read metagenomes with metaBAT2 and dereplicated with dRep at 98% ANI.

metaBAT2_short-read_assemblies_bins.zip contains all bins reconstructed from the short-read assemblies with metaBAT2 (i.e, output of metaBAT2).

metaBAT2_long-read_assemblies_bins.zip contains all bins reconstructed from the long-read polished assemblies with metaBAT2 (i.e, output of metaBAT2).

Output_dRep_98ANI_407_bins_long-short-reads.zip contains the output of the dereplication analysis done on the set of 407 high-quality selected genome bins reconstructed from long- (n = 201) and short-read (n = 206; labeled "spad") metagenomes. It was done with dRep using default parameters. After this step, the final dataset included 314 high-quality non-redundant genome bins. This folder includes:

  • The primary clustering of selected genome bins using the Mash algorithm with an ANI threshold of 90%.
  • The secondary clustering of selected genome bins using the fastANI algorithm with an ANI threshold of 98%.
  • The clustering score attributed to each genome bin during dereplication. * symbol indicates genomes chosen to be the representative genomes of their cluster.

 

  • Phylogenetic datasets

Fig1_supplementary_files_alignment_tree.zip contains the alignment of the concatenated markers and the final tree reconstructed by PhyloPhlAn v3.0.58 (Asnicar et al, 2020) for the 314 high-quality selected genome bins and the 2496 prokaryote reference genomes.

Fig2_supplementary_files_alignment_tree.zip contains the alignment of the 394 GH18 sequences identified in 132 high-quality selected bins done with MAFFT v7.450 and the corresponding tree inferred with RAxML v8.2.11 within Geneious Prime 2022.0.2.

Phylogeny_314_selected_bins_dRep.zip contains the alignment of the concatenated markers and the final tree reconstructed by PhyloPhlAn v3.0.58 (Asnicar et al, 2020) for the 314 high-quality selected and dereplicated genome bins.

Phylogeny_407_selected_bins_nodRep.zip contains the alignment of the concatenated markers and the final tree reconstructed by PhyloPhlAn v3.0.58 (Asnicar et al, 2020) for the 407 high-quality selected genome bins before dereplication.

 

  • Bins across samples

detection_bins_across_gut_metagenomes.txt is a tab-delimited file containing the detection values inferred by anvi'o v7 (Eren et al, 2021) for the 314 high-quality selected bins across the 29 gut metagenomes from the nine focal myrmecophagous species. 

abundance_bins_gut_metagenomes.txt is a tab-delimited file containing the absolute abundance values inferred by anvi'o v7 (Eren et al, 2021) for the 314 high-quality selected bins across the 29 gut metagenomes from the nine focal myrmecophagous species. 

detection_bins_across_soil_samples.txt is a tab-delimited file containing the detection values inferred by anvi'o v7 (Eren et al, 2021) for the 140 high-quality selected bins reconstructed from the aardvark, ground pangolin and southern aardwolf gut metagenomes across the eight soil samples collected on sample sites in South Africa.

Files

abundance_bins_across_gut_metagenomes.txt

Files (18.6 GB)

Name Size Download all
md5:52f0c146464e3931b23a2e887bf64d2d
195.0 kB Preview Download
md5:b6e59daf5265526e3762fd89c4487536
193.4 kB Preview Download
md5:ec45a9515688669dfe441834c7d3eec8
25.7 kB Preview Download
md5:da0c77f27245befc3734765f51e67f14
3.9 MB Preview Download
md5:8106eb1571b43ba023f744aa99fe5771
75.0 kB Preview Download
md5:ecb6da8a5ad32eace550235b374c0ddc
241.8 MB Preview Download
md5:a1ee3ee55cccd5d997e2826aba67f637
2.5 GB Preview Download
md5:42da88678ace3bb775295cd147994c7c
2.5 GB Preview Download
md5:5db7ccc6e14156cf8b729ed35f855d96
3.4 MB Preview Download
md5:c1ca59fac90e3c67a5a0479c10eaea6b
2.2 GB Preview Download
md5:8b51a157c722034cf17e046f4ffe5094
2.1 GB Preview Download
md5:ec26a54786ad50bb287d519ad14d3c6e
512.5 kB Preview Download
md5:daa3c8b2ff944d3aeaa45531e0bb94fb
434.7 kB Preview Download
md5:0c97e2c19ecbc004a0da30e84e1d603a
555.8 kB Preview Download
md5:7fb96145b69fd08612c19ac050061eb8
9.0 GB Preview Download
md5:efb1c502deb647949d219ce25fb5afb5
1.2 MB Preview Download
md5:945441e38b61a34d8a0e09425f52eaa5
2.7 MB Preview Download
md5:58d82ecb909b728ed765a14b80f09715
213.2 kB Preview Download
md5:f2524e4633fec840782ff0e7be6a8fbc
1.8 MB Preview Download
md5:bed0df4a335fe4228319830c340b71af
839.7 kB Download

Additional details

Funding

European Commission
ConvergeAnt - An Integrative Approach to Understanding Convergent Evolution in Ant-eating Mammals 683257
Agence Nationale de la Recherche
CEBA - CEnter of the study of Biodiversity in Amazonia ANR-10-LABX-0025
Agence Nationale de la Recherche
CeMEB - Mediterranean Center for Environment and Biodiversity ANR-10-LABX-0004