Published March 27, 2025 | Version v1
Dataset Open

Ecological interactions drive evolutionary outcomes: The first example of major host plant shifts mediated by parasitic plants in insects

Description

Ecological interactions drive evolutionary outcomes: The first example of major host plant shifts mediated by parasitic plants in insects

Authors were anonymised during the reviewing process

Abstract

  • Phytophagous insects have specialized on virtually every plant lineage. Parasitic plants, however, are uncommon hosts. Among insects, only a single lineage of weevils, the Smicronychini, has successfully radiated on both parasitic and non-parasitic plants in a large panel of distantly related Asterid families. This unusual pattern suggests that major host plant shifts have occurred over the course of their diversification. Through the analysis of a phylogenomic dataset, we reconstruct for the first time their evolutionary history and ancestral host plant associations. Our results show that independent host plant shifts occurred both from parasitic to non-parasitic hosts and between distinct parasitic lineages. These results suggest that host shift mechanisms can be driven by ecological opportunities provided by plant-plant interactions. This first evidence of extreme insect host plant shifts mediated by parasitic plant-plant interactions emphasizes the core importance of ecological interactions as driving forces behind insect host plant shifts.

 

Figures & Tables

Figure 1: Host repertoire of Smicronychini weevils. From top to bottom: diagrams represent parasitic interactions between Smicronychini weevils and their host plants. Host plant families that make up the host repertoire of Smicronychini are shown below each corresponding diagram, with examples of associated weevils and known galls. Credits: BZ & JH, except for Orobanchaceae (Clémence Massard) and Asteraceae (@sinaloasilvestre, iNaturalist) pictures; diagrams with BioRender ©.

Figure 2: Phylogenomic tree based on the AHE dataset. A. Maximum likelihood tree of the AHE dataset. Nodes poorly supported (SH-aLRT < 80% or UFBS < 95%) are respectively highlighted with gray triangles and circles. For graphical purposes, some branches were shortened to half of their length (//). Pictures of weevils are linked to corresponding species. B. Partitioning scheme of a theoretical AHE loci. Flanking and inserted partitions (f1-3) are highlighted in grey and coding partitions (c1-3) are highlighted in green. The region targeted by the probes (P) is shown in red but does not appear in the final supermatrix. Credits: JH.

Figure 3: Ancestral host plant estimations of Smicronychini weevils and their current ecologies. The phylogenetic tree topology corresponds to one of the ML trees obtained from the extended dataset pruned to one branch per species. Species grafted with COI data are less robust than the AHE backbone and are represented with dashed branches. Gall-inducing species are marked with a “x” between branch tips and species names. Host plants of each species of weevil are colored according to their botanical family (on the right) and parasitic type (in top-right corner) as illustrated in Figure 1, while unknown states are left blank. Pie charts represented on each node correspond to the estimated likelihood of each character state computed by the ace function. Credits: BZ & JH

 

Appendices

Figure S1: Flowchart of the phylogenomic pipeline developed to process AHE data. Each step, numbered from 1 to 10 as referenced in the main text, is represented, from raw reads to the complete supermatrix. External tools and custom scripts (marked with an *) are given, along with a diagram representing each step of the pipeline.

Figure S2: Complete grafted tree of the extended dataset. ML tree generated with MFP+MERGE model on IQTREE v2.3.2 using the AHE topology as backbone. To increase computation time, -allnni option was not used. Bootstraps show SH-aLRT and UFBS values respectively.

Figure S3: Grafted tree of the extended dataset with one specimen per species. ML tree generated with MFP+MERGE model on IQTREE v2.2 using the AHE topology as backbone. Bootstraps show SH-aLRT and UFBS values respectively.

Table S1: Detailed interactions of species represented in the extended dataset. Informations marked with an asterisk (*) are considered unsure. Parasitic mode (PM): np = non-parasitic, hemip = hemiparasitic, p = holoparasitic.

Table S2: Complete specimen data. Sample IDs highlighted in red were removed from the final dataset, those highlighted in yellow were merged to an AHE sample as mentionned in "COI" column.

 

Zenodo supplementary files

01_PREPROCESS: Commands and files used to gather AHE probes targeted sequences from Anthonomus grandis genome

see README.sh for a detailed description of files and commands used to obtain them.

02_SCRIPTS: Scripts used during phylogenomic analysis.

Most scripts are used by the shell pipeline "04-FcC_HMMC_PB_elongated_probes.sh". Other softwares used by the pipeline that are available online are not listed but can be found inside the pipeline script or in figure S1.

bed and gapped_refs folders contain necessary files for the pipeline to run, see 01_PREPROCESS material to see how they were generated.
bed - genomic data from Anthonomus grandis reference used by the pipeline steps 7-8
gapped_refs - probe consensus sequences targeted on Anthonomus grandis reference genome, split fasta from consensus_probes_gap.fasta

04-FcC_HMMC_PB_elongated_probes.sh - Semi-automatic pipeline routinely used during analyses
check_het.R - estimates genetic distance
clean_duplicated_seq.sh - removes or makes a consensus of sequences from specimens with multiple sequences depending on genetic distance
count_gap_per_site.py - computes number of gaps per site
drop_short_seq.py - removes sequences shorter than a given fraction of non-gaps
HMMcleanNuc.pl - Nucleotide derived HMMCleaner script: Di Franco, A., Poujol, R., Baurain, D. & Philippe, H. (2019). Evaluating the usefulness of alignment filtering methods to reduce the impact of errors on evolutionary inferences. BMC Evol. Biol., 19, 21.
improve_parts.R - updates partition files based on Anthonomus grandis reference genome.
make_codon_partitions_forProbe_from_charset.py - makes codon partitions from a nexus "charset" file
make_codon_partitions_forProbe.py - makes codon partitions from partition files ".part"
make_gapped_parts.py - makes partition files based on the probe reference considering gaps in probe sequence
make_good_fasta.py - converts block fasta to one line format
PaulBlock.pl - removes columns with more than a given threshold of gaps (only for non-coding parts)
remove_seq_given_ID.py - removes sequences with a strict given ID
remove_seq_given_relaxed_ID.py - removes sequences with a given pattern
seqCat.pl - cf file header
seqConverter.pl - cf file header
trim_partition_file.R - trims sequences overlapping with each other based on bed formatted reference genome

03_TREES: Intermediate and finales files generated by the phylogenomic pipeline

    AHE: Phylogenomic analyses on AHE dataset

AHE_2024_v8.3_70.phylip - Final sequence file obtained with the phylogenomic pipeline, only keeping loci with >70% species
AHE_2024_v8.3_70.nex - Partition file with 1 partition per flanking/coding regions (see Fig. 3B in main text)
AHE_2024_v8.3.70.MFPMERGE* - Tree generated with MFP+MERGE model

    COI: Phylogenetic analyses on COI dataset

COI_2025.v1.fasta - fasta alignment of all cytochrome oxydase I samples cleaned from bad sequences
COI_2025.v1.noemptyseq* - COI tree of all samples, cleaned from empty sequences (corresponding to AHE specimen without COI)

    grafting: Phylogenomic analyses on extended dataset

# Phylogenetic analyses
constraintfile.txt - Backbone generated by AHE analyses
AHE_COI_2025.v1* - Grafted tree with all samples, launched without --allnni iqtree option to reduce computation time
AHE_COI_2025.v2.monosp.nex | .phylip - Nexus and phylip files resulting from manual removal of all but one representative of each species based on AHE_COI_2025.v1.grafting.noallnni and COI tree (for dentirostris and rubricatus), see tips highlighted in red in AHE_COI_2025.v1.grafting.noallnni.treefile.keepsp.pdf for samples kept in v2
AHE_COI_2025.v2* - Grafted tree with one sample per species

# Topology tests
topo1.newick - Same topology as AHE_COI_2025.v2bis.grafting.monosp.treefile
topo2* - Alternative topology assuming the monophyly of North American Smicronyx species
topo3* - Alternative topology assuming the monophyly of North American Smicronyx and Promecotarsus species
topotest.iqtree - results of the topology test showing no significative difference between the three tested topologies

04_ASE: Ancestral character state estimations are run with ASE_v3.R

topo1 = paraphyletic Asteraceae-feeding species
topo2 = monophyletic Asteraceae-feeding species
Both topologies are equally likely (cf ../03_TREES/grafting/topotest.iqtree)

ACE_HPPM_ER.topo1.pdf - ACE with ace() function on topo1
ACE_HPPM_ER.topo2.pdf - ACE with ace() function on topo2
ACE_HPPM_Likelihoods.csv - Ancestral state likelihoods at each node of ACE_HPPM_ER.topo2.pdf
ASE_pruned_topo1_collapsed_HPPM_ER.pdf - ACE with Phytools make.simmap etc. on topo1. NA's were removed and bootstraps below thresholds were left polytomic
ASE_pruned_topo2_collapsed_HPPM_ER.pdf - same on topo2
AHE_COI_2025_70coll.v2.grafting.monosp.contree - topo1 collapsed if bootstraps <70
AHE_COI_2025.v2.grafting.monosp.contree - topo1
host_plants.txt - table with host repertoire of each species (see Table S1)
topo2_70coll.rooted.contree - topo2 collapsed if bootstraps <70
topo2.rooted.contree - topo2

Raw data & sequences

Raw target capture data and assembled AHE have been deposited in GenBank (NCBI) under the accession PRJNA1244829.

COI sequences generated for the present study have been deposited on BOLD systems (pending).

Files

01_PREPROCESS.zip

Files (356.8 MB)

Name Size Download all
md5:12d1ee8b7cec153075b2458d76b1ebdc
299.2 MB Preview Download
md5:225baa31ec429b36b49e803c26aa2d61
442.4 kB Preview Download
md5:0897b02b42984fefb30948b4550c29cd
12.2 MB Preview Download
md5:dd4abbfb5bb3ee4b3b60a81e000aad91
260.4 kB Preview Download
md5:b000dd3c312c1e03d9fcf473eab9676e
29.7 MB Download
md5:b2304ac72db889d9d2e17f04a8d36128
3.1 MB Download
md5:db5b4d5bb05f2487bfb4f79a5976f997
11.1 MB Download
md5:50c66c558c8c484aeeff5505ebd48b69
716.1 kB Download
md5:ea087b0ef3d6322596b9d6f802544e66
58.2 kB Preview Download
md5:a50cc2155e19a6eedd2481d636109125
50.8 kB Preview Download
md5:beeb4804c8eab0c2bd5f63e795a53277
30.6 kB Download
md5:f75e3cb7b1403441d689132fc1b6f64b
38.8 kB Download

Additional details

Related works

Is part of
Preprint: 10.1101/2024.04.03.587887 (DOI)