Published October 26, 2020 | Version v1
Dataset Open

Dsuite - fast D-statistics and related admixture evidence from VCF files

  • 1. University of Basel
  • 2. University of Zurich
  • 3. University of Antwerp

Description

Patterson's D, also known as the ABBA-BABA statistic, and related statistics such as the f4-ratio, are commonly used to assess evidence of gene flow between populations or closely related species. Currently available implementations often require custom file formats, implement only small subsets of the available statistics, and are impractical to evaluate all gene flow hypotheses across datasets with many populations or species due to computational inefficiencies. Here we present a new software package Dsuite, an efficient implementation allowing genome scale calculations of the D and f4-ratio statistics across all combinations of tens or hundreds of populations or species directly from a variant call format (VCF) file. Our program also implements statistics suited for application to genomic windows, providing evidence of whether introgression is confined to specific loci and it can also aid in interpretation of a system of f4-ratio results with the use of the 'f-branch' method. Dsuite is available at https://github.com/millanek/Dsuite, is straightforward to use, substantially more computationally efficient than comparable programs, and provides a convenient suite of tools and statistics, including some not previously available in any software package. Thus, Dsuite facilitates the assessment of evidence for gene flow, especially across larger genomic datasets.

Notes

Funding provided by: EMBO
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100003043
Award Number: ALTF 456-2016

Funding provided by: Norges Forskningsråd
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100005416
Award Number: 275869

Funding provided by: Swiss National Science Foundation*
Crossref Funder Registry ID:
Award Number: 176039

Funding provided by: Swiss National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100001711
Award Number: 176039

Files

DRYAD_README.txt

Files (91.0 MB)

Name Size Download all
md5:399331ad14b6b10b955a7a7c82e3f9c4
940 Bytes Preview Download
md5:9a0843f9ab83d19c47bf33b5c5423934
90.7 kB Preview Download
md5:3635f08856097603972e3dc0381cbb4f
6.4 kB Preview Download
md5:fed3ca4c8d984417ee02de535f98e644
49.8 MB Download
md5:692c1ed71cff13dabc3d1e835c5c3080
4.8 kB Preview Download
md5:c11ad0cb8444456515812261bac76a03
980 Bytes Preview Download
md5:5ec72a568665d32c3b98f15387c6629a
9.7 MB Download
md5:d60a7bc8470e6230a0904c0e7e7e16e2
31.3 MB Download
md5:a09859c4ad49c68da6e7ebdd8463a51f
2.8 kB Download
md5:a0545c583dce1a792f2197ad5f9ccd32
555 Bytes Preview Download

Additional details

Related works