Supporting Data for: Ocean-wide conservation genomics of blue whales suggest new Northern Hemisphere subspecies
Authors/Creators
- 1. University of Münster
- 2. Senckenberg Society for Nature Research
- 3. LOEWE Centre for Translational Biodiversity Genomics
Description
The blue whale is an endangered and globally distributed species of baleen whale with multiple described subspecies assignments, including the morphologically and molecularly distinct pygmy blue whale, among others. North Atlantic and North Pacific populations, however, are currently regarded as a single subspecies despite being separated by continental land masses and differences in their acoustic communication. To determine the degree of isolation among the Northern Hemisphere populations, fourteen North Pacific and six Western Australian blue whale nuclear and mitochondrial genomes were sequenced and analyzed combinedly with eleven publicly available North Atlantic blue whale genomes. This allowed to contrast the genetic differentiation and genetic exchange among Northern Hemisphere populations to the Western Australian pygmy blue whale subspecies. Population genomic analyses revealed distinctly differentiated clusters and limited exchange among all three populations, indicating a high degree of isolation. Nevertheless, the genomic and mitogenomic distances between all blue whale populations, including the Western Australian pygmy blue whale, are low when compared to other inter-subspecies distances in cetaceans. Given that the pygmy blue whale is an already recognized subspecies and further supported by previously reported acoustic differences, a proposal is made to treat the two Northern Hemisphere populations equally as subspecies, namely Balaenoptera musculus musculus (North Atlantic blue whale) and Balaenoptera musculus sulfureus (North Pacific blue whale). Furthermore, a first molecular vitality assessment of all three populations found a generally high genomic diversity among blue whales but also a lack of rare alleles, non-neutral evolution and substantially increased effects of inbreeding. This suggests a substantial anthropogenic impact on the genotypes of blue whales and calls for careful monitoring in future conservation plans.
Notes
Methods
We sequenced the genomes from 20 blue whale specimens gathered from the North Pacific blue whale and West-Australian pygmy blue whale populations and analyze the data together with 12 publicly available genomes of other blue whales, including those of the North-Atlantic blue whales (Jossey et al., 2024). This sampling is further complemented by three genomes of the closely related sei whale (Balaenoptera borealis), of which one is sequenced in this study.
All Illumina paired-end libraries were prepared by Novogene, Cambridge, United Kingdom using the NEBNEXT DNA LIBRARY PREP kit with a read length of 150 base pairs (bp) and an insert size of 350 bp. Illumina sequencing was performed on a NovaSeq 6000 platform targeting ~20x coverage per individual.
A comprehensive pipeline used to process the data and perform many of the here presented downstream analyses can be found on GitHub: mag-wolf/RESEQ-to-Popanalyses/.
Short read data were trimmed for quality and adapter sequences using FASTP V0.23.2 (Chen, Zhou, Chen, & Gu, 2018) with the options "-g -3 -l 40 -y -c -p". Trimmed reads were mapped to a repeat-masked, high-quality blue whale reference genome, constructed by the authors of the Vertebrate Genome Project (Bukhman et al., 2024). Mapping was performed using BWA MEM v0.7.17-r1188 (http://bio-bwa.sourceforge.net) and SAMtools v1.9 sort (Danecek et al., 2021) using default settings. Potential duplicates were removed and read-groups were added using the PICARD v2.21.2-0 toolkit (https://broadinstitute.github.io/picard/). Genotype-calling of autosomes, sex-chromosomes and mitochondrial genomes was done on all individuals combined and on each individual independently using BCFTOOLS v1.12 MPILEUP and BCFTOOLS v1.12 CALL (Danecek et al., 2021) with the respective "-m" or "-c" flag and minimum mapping- and base-quality cutoffs of 20 and 13, respectively. For the sex-chromosome and mitochondrial genome data, BCFTOOLS CALL was run with the "--ploidy 1" flag to account for the haploidy. All inferred sites were further filtered by excluding sites with divergent read coverage (>3-fold and <0.3-fold of the expected individual mean coverage) and sites with more than 25% missing data using the BCFtools filter function. The combined genotype set was further processed by removing multivariate and monomorphic sites to retrieve single nucleotide polymorphisms (SNPs) with VCFTOOLS v0.1.16 (Danecek et al., 2011) and by thinning SNPs to account for the effects of linkage disequilibrium using BCFTOOLS THINN function with a window size of 1000 bp. To receive variances of the mitochondrial control region (mtMarker), we extracted respective 403 bp long area from the mitogenomic vcf file using BCFTOOLS VIEW "-r" (See Rosel, et al., 1994).
References:
Jossey S, Haddrath O, Loureiro L, et al. (10 co-authors). 2024. Population structure and history of North Atlantic Blue whales (Balaenoptera musculus musculus) inferred from whole genome sequence analysis. Conserv Genet. 25(2):357–371.
Chen, S., Zhou, Y., Chen, Y., & Gu, J. (2018). fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics (Oxford, England), 34(17), i884-i890. https://doi.org/10.1093/bioinformatics/bty560
Bukhman YV, Morin PA, Meyer S, et al. (33 co-authors). 2024. A High-Quality Blue Whale Genome, Segmental Duplications, and Historical Demography. Molecular biology and evolution. 41(3).
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., . . . Durbin, R. (2011). The variant call format and VCFtools. Bioinformatics (Oxford, England), 27(15), 2156–2158. https://doi.org/10.1093/bioinformatics/btr330
Danecek, P., Bonfield, J. K., Liddle, J., Marshall, J., Ohan, V., Pollard, M. O., . . . Li, H. (2021). Twelve years of SAMtools and BCFtools. GigaScience, 10(2). https://doi.org/10.1093/gigascience/giab008
Rosel, P. E., Dizon, A. E., & Heyning, J. E. (1994). Genetic analysis of sympatric morphotypes of common dolphins (genus Delphinus). Marine Biology, 119(2), 159–167. https://doi.org/10.1007/BF00349552
Files
Files
(1.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:ef9101043c5595e3887e8604dadb222b
|
1.2 MB | Download |
Additional details
Related works
- Is source of
- 10.5061/dryad.47d7wm3jz (DOI)