Published November 3, 2020 | Version v1.0
Dataset Open

Genome-wide characterization of human minisatellite VNTRs: population-specific alleles and gene expression differences

Contributors

Contact person:

  • 1. Benson

Description

This repository consists of minisatellite VNTR genotypes for 2,800 samples (2,770 individuals). The raw VCF files were produced using VNTRseek on xxx data sources: 30 high coverage WGS datasets from the 1000 Genomes Project phase 3, 2,504 unrelated genomes from New York Genome Center (NYGC), 253 genomes from Simons Diversity Genome Project (SGDP), two tumor-normal breast cancer samples from Illumina Basespace, haploid genomes CHM1 and CHM13, and seven genomes from the Personal Genome Project from the Genome In A Bottle Consortium (GIAB). Raw VCF files are provided for each data source separately.

The raw VCF files were preprocessed (preprocess.sh) to extract genotypes and provided in VNTRseek_preprocessed_data.tar.gz (uncompressed size 10G). The R Markdown code to analyze the preprocessed data and produce figures and tables is also provided (tables_and_figures.Rmd). For more information see the ReadMe file.

This work was supported in part by NSF grants IIS-1423022 and DBI-1559829.

Files

README.md

Files (23.0 GB)

Name Size Download all
md5:b95a875853db9149fdaa11efe8b89d01
238.9 MB Download
md5:2f55fe4545e146d9740d815d2a4a473d
25.7 MB Download
md5:96cdc5c1bd11094028d0bdb47aa1ce40
1.8 kB Download
md5:b005acde19ce6c986f77767302b0f4b0
19.9 MB Download
md5:b6d11df07079d3edbf19421078fa8224
13.2 MB Download
md5:b93debc3157cb50ec4ec0f7e36ea7460
55.2 MB Download
md5:740b65b25dd83999353a074c16ea82d4
15.3 MB Download
md5:600db45b436ace8375997a221ce255fb
1.8 GB Download
md5:cc426f8c17830e694ba7ada91c645ae6
1.8 GB Download
md5:03a28f56d457859aea8f59d527eaad49
1.8 GB Download
md5:ab5e8675212910f26a90888533bb69aa
1.8 GB Download
md5:b676eaa2c715e08bfa847c2e983127e0
1.8 GB Download
md5:97fddd9fec526f8c0e931a6c59876d62
1.8 GB Download
md5:da4c7b620bff85bf7626cb1727ff7298
1.8 GB Download
md5:a80ab4d9aad4f1fa7de115b7a1790c98
1.8 GB Download
md5:4f5a63474910904b1b957aa39f957467
1.8 GB Download
md5:de25319bb1b57bb1f591a9e6f1a8d2dc
1.8 GB Download
md5:d3578e06be2522b982adfe23c73221e2
2.0 kB Download
md5:df43e9eae9de6afd89f6326c114d9852
1.3 kB Preview Download
md5:12461623fca566d95c15c50c25b5cc17
1.3 GB Download
md5:5d17c96ae234d05b868d8d4e1218ed73
77.2 kB Preview Download
md5:e34994c50ee4adec93f78e93f0628692
1.6 GB Download
md5:8470e3fb8fdc7844cb27c844743236b4
80.2 kB Download
md5:bd63db99244abee1ba9cfede2f1b81d9
1.3 GB Download