There is a newer version of the record available.

Published March 17, 2025 | Version v2
Peer review Open

The MUC19 Gene: An Evolutionary History of Recurrent Introgression and Natural Selection

  • 1. ROR icon University of Colorado Boulder
  • 2. EDMO icon Brown University
  • 3. ROR icon University of Washington
  • 4. ROR icon University of Copenhagen
  • 5. ROR icon Clemson University
  • 6. EDMO icon University of Padova, Department of Biology
  • 7. ROR icon University of Turin
  • 8. EDMO icon University of California, Berkeley
  • 9. CNRS Délégation Ile-de-France Sud
  • 10. ROR icon Universidad Nacional Autónoma de México

Description

We study the gene MUC19, for which some modern humans carry a Denisovan-like haplotype. MUC19 is a mucin, a glycoprotein that forms gels with various biological functions. We find diagnostic variants for the Denisovan-like MUC19 haplotype at high frequencies in admixed Latin American individuals, and at highest frequency in 23 ancient Indigenous American individuals, all predating population admixture with Europeans and Africans. We find that the Denisovan-like MUC19 haplotype is under positive selection and carries a higher copy number of a 30 base-pair variable number tandem repeat, and that copy numbers of this repeat are exceedingly high in American populations. Finally, some Neanderthals carry the Denisovan-like MUC19 haplotype, and that it was likely introgressed into human populations through Neanderthal introgression rather than Denisovan introgression.

All scripts for the entire project can be found here: https://github.com/David-Peede/MUC19/tree/main

Files

  • ZENODO_README.md: Description of datasets.
  • papuan_phased_hmmmix_haploid_tracts_per_ind.tar.gz: Unfiltered, decoded, and annotated hmmix autosomal tracts for all Papuans.
  • tgp_hmmmix_haploid_tracts_per_ind.tar.gz: Unfiltered, decoded, and annotated hmmix autosomal tracts for the 1KG individuals.
  • den_tracts_in_pap.tar.gz: Coordinates for the putative Denisovan tracts for the Papuan individuals.
  • tgp_hmmix_haplotype_tracts_chr12.csv.gz: Pre-processed Chr12 archaic tracts for the 1KG individuals.
  • tgp_hmmix_haplotype_tracts_muc19.csv.gz: Pre-processed archaic tracts overlapping the MUC19 region for the 1KG individuals.
  • papuan_hmmix_haplotype_tracts.csv.gz: Pre-processed autosomal archaic tracts for the Papuan individuals.
  • dataset_1.csv.gz: Sequence divergence results between the haplotypes in the 1KG and the four high-coverage archaic individuals at the focal 72kb MUC19 region.
  • dataset_2.csv.gz: Sequence divergence results between the haplotypes in the 1KG and the four high-coverage archaic individuals at the focal 742kb MUC19 region.
  • dataset_3.csv.gz: Sequence divergence results between the haplotypes in the 1KG and the phased late Neanderthal haplotypes at the focal 72kb MUC19 region.

 

Files

ZENODO_README.md

Files (169.3 MB)

Name Size Download all
md5:93353b06c82c3d6de6e40aef679dcf78
534.8 kB Download
md5:a763d745afba7f65a466a451a6ff26ca
567.5 kB Download
md5:aadf9c227d2607e70c5b470d8047d458
529.2 kB Download
md5:2b27f3ce906f094ea4e04ef6e91de899
46.3 kB Download
md5:0a87403b475796b890099fd88070afb2
784.2 kB Download
md5:5e0548388604cf95dd09954acdc19611
2.1 MB Download
md5:ab002f6ff715f9e2bb1fbd387265bbbd
3.3 MB Download
md5:2f2f41800667dd11517831992524e6d9
8.2 kB Download
md5:6db2473c9206e820f3675a53da53e5f3
161.5 MB Download
md5:4a8c078f1bc5e55e7108572739cae89f
15.0 kB Preview Download

Additional details