Supplementary information for: Using networks to identify structure in phylogenetic tree sets
Creators
- 1. Louisiana State University
- 2. Florida State University
- 3. Xiamen University
- 4. North Carolina State University
- 5. University of Minnesota
Description
Modern phylogenomic studies produce large sets of trees that can represent variation in inferred phylogenies across genes, uncertainty in estimated phylogenies for a given gene, or both. Standard practice is to condense this variation down to a small set of point estimates or consensus trees in order to facilitate display and interpretation. However, doing so results in the loss of enormous amounts of information about the structure of the underlying tree set. Here, we propose new approaches to explore and detect structure in the tree set itself. These approaches rely on the well-developed mathematical foundations of community detection in networks and leverage two different network types. The first type uses nodes to represent trees and connects these nodes with edges whose weights are determined by the similarity (affinity) of the trees. The second type uses nodes to represent bipartitions and connects nodes with edges whose weights represent the covariance in bipartition presence/absence across trees in the set. These two network types carry information that is complementary, but not identical. A variety of methods may be applied to both networks in order to identify interesting community structure. These community detection approaches provide a rich view of the information contained in phylogenomic data sets and facilitate investigation into the forces driving inferred phylogenetic variation across genomes.
Notes
Files
Brown_etal_SystBiol_20_SuppInfo.pdf
Files
(1.0 MB)
Name | Size | Download all |
---|---|---|
md5:28558e4ef655c0251d38a0f1b3451571
|
1.0 MB | Preview Download |