Published November 12, 2024 | Version v1
Software Open

Population structure and demographic analyses of Acanthocybium solandri from the Indo-Pacific and Atlantic oceans

Authors/Creators

  • 1. University of Melbourne

Description

This repository contains scripts, data and results for a populaton genomics study of genetic structure and demography of wahoo, Acanthocybium solandri, published in Journal of Biogeography:

Haro-Bilbao et al. (2021) Global connections with some genomic differentiation occur between Indo-Pacific and Atlantic Ocean wahoo, a large circumtropical pelagic fish. Journal of Biogeography. doi.org/10.1111/jbi.14135

In this work, we generated population allele frequencies for wahoo sampled at 11 locations around the globe using a pooled ezRAD approach. Using thousands of genome-wide SNPs, we demonstrated a significant (but subtle) genetic divide between wahoo from the Indo-Pacific and those from the Atlantic. This genetic differentiation likely occurs against a background of high gene glow throughout the evolutionary history of wahoo, as we inferred from demographic analysis of select population pairs within and between oceanic regions. Analyses contained in this repository are for: (1) Filtering pooled ezRAD allele counts (assembled with dDocent and imputed using poolne_estim); (2) Estimation of genetic differentiation among globally sampled wahoo populations; (3) Estimation of site frequency spectra from joint allele frequencies among select population pairs; (4) Inference of demographic parameters (using δaδi); and (5) Generations of demographic simulation summary statistics. Most of the analyses are performed in R and can be run directly from within the repository directory, this includes: allele filtering, estimation of genetic differentiation, estimaiton of site frequency spectra, and generation of demographic summary statistics. Demographic inference using δaδi requires setup of a Unix environment: input data files and execution scripts are provided, but their implementation needs to be customised.

Notes

All R code can be run from within the respository directory using the R project file, Wahoo_PROJ.Rproj.

Demographic analyses using δaδi must be run in a Unix environment. The scripts Wahoo_DADI_Demog_Models.py and Wahoo_DADI_Generic_Execute.py can be used to set up a pipeline for executing demographic simulations in a local system or on an HPC cluster.

Methods

Allele frequency data was obtained through a pooled ezRAD approach. De novo assembly of RAD contigs and variant calling was performed using the dDocent pipeline. Population allele frequencies were imputed using poolne_estim. Additional quality filtering was performed in R. Analysis of genetic differentiation was performed in R, which include: estimates of FST and AMOVA (analysis of molecular variance). Generation of site frequency spectra and summary of demographic analyses was performed in R. Demographic inference was performed using δaδi, originally on an HPC. 

Files

Files (38.0 kB)

Name Size Download all
md5:d76091a07881d90d1f9a5fe73228f1c5
9.8 kB Download
md5:67195f121c795be2aa6953c65185b58f
6.4 kB Download
md5:aeac7fbd3707ea0abea4e5870f551f0a
4.1 kB Download
md5:327c18f977b0ae05108c7c66c94532f2
5.0 kB Download
md5:877f04a7d53c29878024ef2bcdc2e501
4.9 kB Download
md5:0a4952ae8de80c624191ff07209560c5
2.4 kB Download
md5:62947463a90442419410ca939f9e174b
5.3 kB Download

Additional details

Related works

Is source of
10.5061/dryad.dncjsxkz4 (DOI)