Published November 27, 2025 | Version v1
Dataset Open

Clade Size Distributions Under the Coalescent Diversification Model

  • 1. EDMO icon Simon Fraser University

Description

Characterizing patterns of biological diversity is a central goal of evolutionary biology. This requires understanding expectations for clade size—the relationship between the number of species in a clade and its age. Such expectations are key for identifying diversity outliers (e.g., specious or depauperate clades in macroevolution, or unusually large or small transmission clusters in epidemiology) and for testing alternative hypotheses about diversification. Here, we develop a general method for deriving closed-form expressions for the (joint) distribution of clade sizes under a given diversification model. We apply this approach to the constant- and variable-size coalescent model as well as the Yule model. Our results reveal that while the coalescent and Yule models produce qualitatively similar clade size patterns, they exhibit quantitative differences. Leveraging the flexibility of the coalescent framework, we further examine how transmission cluster size distributions differ between rapidly and slowly growing epidemics, finding—counterintuitively—that slowly growing epidemics are more likely to generate large clusters, a pattern often attributed to increased transmission.  

Here we provide the Supplementary Mathematica file and accompanying PDF for the analysis of clade size.

Files

CoalescentYule_11_21.pdf

Files (7.0 MB)

Name Size Download all
md5:25f6e7a70777a01e785803b851d82c54
6.4 MB Download
md5:3a5df45d44d50852c62398282006d543
538.3 kB Preview Download

Additional details

Funding

Natural Sciences and Engineering Research Council
CRC-2021-00276
Natural Sciences and Engineering Research Council
RGPIN-2022-03113
Natural Sciences and Engineering Research Council
RGPIN-2019-06624

Dates

Submitted
2025-11