Published January 31, 2019 | Version v1
Dataset Open

Data from: Allele phasing has minimal impact on phylogenetic reconstruction from targeted nuclear gene sequences in a case study of Artocarpus

  • 1. University of Florida
  • 2. Texas Tech University
  • 3. Northwestern University
  • 4. Department of Plant Sciences; Chicago Botanic Garden; 1000 Lake Cook Road Glencoe IL 60022 USA*

Description

Premise of the study: Untapped information about allelic diversity within populations and individuals (i.e. heterozygosity) could improve phylogenetic resolution and accuracy. Many phylogenetic reconstructions ignore heterozygosity because it is difficult to assemble allele sequences and combine allelic data across unlinked loci and it is unclear how reconstruction methods accommodate variable sequences. We review the common methods of including heterozygosity in phylogenetic studies and present a novel method for assembling allele sequences from target enriched Illumina sequencing libraries. Methods: We perform supermatrix phylogeny reconstruction and species tree estimation of Artocarpus based on three methods of accounting for heterozygous sequences: a consensus method based on de novo sequence assembly, the use of ambiguity characters, and a novel method for phasing alleles. We characterize the extent to which highly heterozygous sequences impeded phylogeny reconstruction and determine whether the use of allele sequences improves resolution or decreases topological uncertainty. Key Results: We show that it is possible to infer phased alleles from target enriched Illumina libraries. We find that highly heterozygous sequences do not contribute disproportionately to poor phylogenetic resolution and that the use of allele sequences for phylogeny reconstruction does not have a clear effect on phylogenetic resolution or topological consistency. Conclusions: We provide a framework for inferring phased alleles from target enrichment data and for assessing the contribution of allelic diversity to phylogenetic reconstruction. In our dataset, the impact of allele phasing on phylogeny is minimal compared to the impact of using phylogenetic reconstruction methods that account for gene tree incongruence.

Notes

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DEB-1342873

Files

artocarpus_alignments.zip

Files (19.3 MB)

Name Size Download all
md5:17fe4576e12d964e8ab9b087b42bf0a6
7.0 MB Preview Download
md5:f13209548944db71e573af027079d34f
624.7 kB Preview Download
md5:f735e734abdbd37c7ff89dbe3817258a
992.4 kB Preview Download
md5:8caddb9ab65f4741804da3aa98122355
10.6 MB Preview Download
md5:7e06dda63e447f7911c4de1770c6b0b7
1.0 kB Download
md5:9d7a49dd26a594938578540e0116a935
10.7 kB Preview Download
md5:a3293bfddd79fc8c35f656c24c649666
2.7 kB Preview Download

Additional details

Related works

Is cited by
10.1002/ajb2.1068 (DOI)