Published May 26, 2023 | Version v1
Other Open

Inferring historical introgression with deep learning

  • 1. Peking University

Description

Resolving the phylogenetic relationships among taxa remains a challenge in the era of big data due to the presence of genetic admixture in a wide range of organisms. Rapidly developing sequencing technologies and statistical tests enable evolutionary relationships to be disentangled at a genome-wide level, yet many of these tests are computationally intensive and rely on phased genotypes, large sample sizes, restricted phylogenetic topologies, or hypothesis testing. To overcome these difficulties, we developed a deep learning-based approach, named ERICA, for inferring genome-wide evolutionary relationships and local introgressed regions from sequence data. ERICA accepts sequence alignments of both population genomic data and multiple genome assemblies, and efficiently identifies discordant genealogy patterns and exchanged regions across genomes when compared with other methods. We further tested ERICA using real population genomic data from Heliconius butterflies that have undergone adaptive radiation and frequent hybridization. Finally, we applied ERICA to characterize hybridization and introgression in wild and cultivated rice, revealing the important role of introgression in rice domestication and adaptation. Taken together, our findings demonstrate that ERICA provides an effective method for teasing apart evolutionary relationships using whole genome data, which can ultimately facilitate evolutionary studies on hybridization and introgression.

Notes

Funding provided by: National Natural Science Foundation of China
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100001809
Award Number: 32170420

Funding provided by: National Natural Science Foundation of China
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100001809
Award Number: 31871271

Files

Zhang_et_al_supp_info_4Mar2023.pdf

Files (30.6 MB)

Name Size Download all
md5:c15dc58ceb15f7f523d0c994969e7205
30.6 MB Preview Download

Additional details

Related works

Is derived from
10.5061/dryad.m905qfv6d (DOI)