Published February 12, 2024 | Version v1
Other Open

Data from: The fundamental role of character coding in Bayesian morphological phylogenetics

  • 1. Southeastern Louisiana University
  • 2. Arizona State University
  • 3. University of Bristol
  • 4. Ludwig Maximilian University of Munich

Description

Phylogenetic trees establish a historical context for the study of organismal form and function. Most phylogenetic trees are estimated using a model of evolution. For molecular data, modeling evolution is often based on biochemical observations about changes between character states. For example, there are four nucleotides, and we can make assumptions about the probability of transitions between them. By contrast, for morphological characters, we may not know a priori how many character states there are per character, as both extant sampling and the fossil record may be highly incomplete, which leads to an observer bias. For a given character, the state space may be larger than what has been observed in the sample of taxa collected by the researcher. In this case, how many evolutionary rates are needed to even describe transitions between morphological character states may not be clear, potentially leading to model misspecification. To explore the impact of this model misspecification, we simulated character data with varying numbers of character states per character. We then used the data to estimate phylogenetic trees using models of evolution with the correct number of character states and an incorrect number of character states. The results of this study indicate that this observer bias may lead to phylogenetic error, particularly in the branch lengths of trees. If the state space is wrongly assumed to be too large, then we underestimate the branch lengths, and the opposite occurs when the state space is wrongly assumed to be too small.

Notes

Funding provided by: National Science Foundation
ROR ID: https://ror.org/021nxhr62
Award Number: DEB 2045842

Funding provided by: National Science Foundation
ROR ID: https://ror.org/021nxhr62
Award Number: DBI 2113425

Funding provided by: Deutsche Forschungsgemeinschaft
ROR ID: https://ror.org/018mejw64
Award Number: HO 6201/1-1

Funding provided by: European Research Council
ROR ID: https://ror.org/0472cxd90
Award Number: GA 101043187

Methods

The datasets are simulated under Mk model. 

In this study, we examine the effectiveness of partitioning by state during a Bayesian morphological phylogenetic analysis. 

So, the datasets that are simulated are analysed under partitioning by state and unpartitioned models. 

Files

Q_Matrix_Size.pdf

Files (136.4 kB)

Name Size Download all
md5:5ee59342ef59bb92024cbebb703ce1c6
136.4 kB Preview Download

Additional details

Related works

Is cited by
10.1093/sysbio/syae033 (DOI)
Is derived from
10.5061/dryad.p2ngf1vvp (DOI)