Published December 2, 2020 | Version v1
Dataset Open

Supplementary information for: Using networks to identify structure in phylogenetic tree sets

  • 1. Louisiana State University
  • 2. Florida State University
  • 3. Xiamen University
  • 4. North Carolina State University
  • 5. University of Minnesota

Description

Modern phylogenomic studies produce large sets of trees that can represent variation in inferred phylogenies across genes, uncertainty in estimated phylogenies for a given gene, or both. Standard practice is to condense this variation down to a small set of point estimates or consensus trees in order to facilitate display and interpretation. However, doing so results in the loss of enormous amounts of information about the structure of the underlying tree set. Here, we propose new approaches to explore and detect structure in the tree set itself. These approaches rely on the well-developed mathematical foundations of community detection in networks and leverage two different network types. The first type uses nodes to represent trees and connects these nodes with edges whose weights are determined by the similarity (affinity) of the trees. The second type uses nodes to represent bipartitions and connects nodes with edges whose weights represent the covariance in bipartition presence/absence across trees in the set. These two network types carry information that is complementary, but not identical. A variety of methods may be applied to both networks in order to identify interesting community structure. These community detection approaches provide a rich view of the information contained in phylogenomic data sets and facilitate investigation into the forces driving inferred phylogenetic variation across genomes.

Notes

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DBI-1262571

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DBI-1934156

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DBI-1262476

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DBI-1934182

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DBI-1934157

Files

Brown_etal_SystBiol_20_SuppInfo.pdf

Files (1.0 MB)

Name Size Download all
md5:28558e4ef655c0251d38a0f1b3451571
1.0 MB Preview Download