Published April 13, 2026 | Version v1
Dataset Open

HipSTR-derived genome-wide STR genotypes from HGDP and SGDP high-coverage WGS data

Description

This dataset contains genome-wide Short Tandem Repeat (STR) genotypes derived from high-coverage whole-genome sequencing (WGS) data from the Human Genome Diversity Project (HGDP) and the Simons Genome Diversity Project (SGDP). The STR genotype calling was performed using the HipSTR algorithm on CRAM/BAM files retrieved from the International Genome Sample Resource (IGSR). Full details of the bioinformatic pipeline and quality control measures are described in the associated manuscript (see Related Identifiers).
 
Data Content
  • Format: VCF
  • Samples: 828 individuals from HGDP and 276 individuals from SGDP.
  • Reference Genome:  GRCh38 

Files

Files (10.7 GB)

Name Size Download all
md5:8c385343612ee35fab497d0898554dc1
382.7 MB Download
md5:f038165124ac3fe000f98eabab50a9fd
93.1 kB Download
md5:af54d0ade5848a68a63b5427099b4ce5
139.5 MB Download
md5:1c600726e1dbac706140fe6e349e2272
99.0 kB Download
md5:c344bae5b6c5bac663999c09e144604e
367.4 MB Download
md5:cec9543337f1c53759dc37c8927c7426
94.0 kB Download
md5:b1a47dddcfce38ec06e1385fb4ab39ab
134.3 MB Download
md5:bdabad38fce9aae74fd266d892595203
100.4 kB Download
md5:e1cb8b7d57a5a608d40da858f143e425
410.7 MB Download
md5:a77c8d5d63928dd1a4b87cf3329b5251
96.1 kB Download
md5:c285ad4d78bbf6d1e5135cefbacf80fa
148.9 MB Download
md5:22af7c0c9e85c27469dba126a07a5470
102.2 kB Download
md5:f4febd128228b0d544b24aa146645c0f
263.4 MB Download
md5:209e11518b306588acd6e1857b525f52
70.0 kB Download
md5:35c98c71e31a05cea450913481c4ff22
99.8 MB Download
md5:4ba2e946dea02b08074a8784bf71303e
75.2 kB Download
md5:e7fcd8fb8f2064ff08f2241f213fe0f7
262.2 MB Download
md5:b1563a3fa057feb3b3cd195664c538fe
64.8 kB Download
md5:c371265274e2934e9453ec52809a74bd
96.3 MB Download
md5:a0454e111921c2683ca37ae900123252
69.4 kB Download
md5:1fc5ceb5f4c3749d4669e9a7488518de
230.7 MB Download
md5:489e1285c015994098b101cc84894249
56.0 kB Download
md5:70d31111fa0ed282de1f147a09efa8a0
83.7 MB Download
md5:e92e80c0eefe3f460853b5a8148b006d
58.8 kB Download
md5:e48a63eb5a4e4a7ad87af92ddb85914e
256.7 MB Download
md5:fd35e122c00c437722136b333d66999f
54.7 kB Download
md5:bfd94c9f74deff0c8c315df7de8865e4
89.7 MB Download
md5:ff92d40e5c071bef93d0b9ef94510a4c
57.5 kB Download
md5:1d71c00a62549e974a9d02224854b0f7
272.9 MB Download
md5:0ac8f0921ec495a69573cc649cb6cfcc
56.5 kB Download
md5:3ec8eeb208984d1ab9f7568382e12611
95.4 MB Download
md5:6afa01883eb75690f5cba4463f3de300
58.7 kB Download
md5:2afdbcab32d79b837b0597e584283bf5
214.9 MB Download
md5:ef179a644e746be2026cbba76b026a13
54.7 kB Download
md5:23b28f5ba86524b395da6e2b6bb257ac
80.3 MB Download
md5:b5c5a68a1fa046943a8cc24af8d8ad1e
57.9 kB Download
md5:b471c74a9d5394f5c0e018baa2962d6b
238.2 MB Download
md5:e28d704662dcea9f4f6f1f9369fa463b
41.4 kB Download
md5:86ab202f22d6e1ba163cfa94dd72d397
80.2 MB Download
md5:c10cf1530b666e6a624d9e017008b9ca
43.0 kB Download
md5:006d122b69a6d7dfe14e327887d7d1b6
669.4 MB Download
md5:d87b6f0f4d371132040ee90f4b96666c
161.2 kB Download
md5:b620ca0b856bc4f65f2e211dbbac07c3
242.6 MB Download
md5:5c64ceb39ade9dd24fceb08ed7c4c8cd
171.7 kB Download
md5:8e67967dd2cc3818dc1039357c358f6b
188.9 MB Download
md5:de3f6199309dbb407568084b355ac06e
44.2 kB Download
md5:4a4634355d5ccefd28d7ed0240133579
66.4 MB Download
md5:6274590751dd9739b69c88c861544780
46.2 kB Download
md5:1a82c15042a2ab33c259e77beee338e0
96.0 MB Download
md5:b568306b9bd46605a8409d8918f7e1f4
23.7 kB Download
md5:d8f83fcba8bff824c13f7f54bdd180a2
35.2 MB Download
md5:ab711863dab1af41a1360687c1b5a0ee
24.7 kB Download
md5:890c4c8b2d6cb5019db6f693cd486515
108.9 MB Download
md5:bebc219f28ecbb5255c6156ace6addea
24.6 kB Download
md5:1f33db3202ead41931c4d76dabc8d289
38.2 MB Download
md5:207b4433603c18fea03b2fe900c2d73e
25.6 kB Download
md5:c15df09b2551b0b4e4a797837eff6d0b
661.0 MB Download
md5:aa927f7ccd91dcd7917033b1940e8155
170.4 kB Download
md5:cd14a13e6f15a35dfdb3511a8c5ee8af
245.9 MB Download
md5:a3f4f6dcf96aa9cabf55cd73811b1e43
183.0 kB Download
md5:5eb5847198c9bd6e3ddb10a58bc03cb6
556.3 MB Download
md5:f029bb748adad1f888ba5367e29ce0f3
143.1 kB Download
md5:1bbe700a8a4a14f909cb7c3d639c6021
206.2 MB Download
md5:90ca06cad30f060a9d416e493c3b3e5c
153.1 kB Download
md5:6a1824839b64d171529843fca7c77e15
514.6 MB Download
md5:4d329c46392d8f514db164bfdd91832a
136.1 kB Download
md5:f23a6a715782059752276574fcbeb2fb
194.2 MB Download
md5:efe75441664840d3b9e3e4c2036d299f
146.6 kB Download
md5:7dbd439d59d3fb1129e2b3bb68a04282
489.8 MB Download
md5:dc8fd4866d04dd9880deae309806a798
128.2 kB Download
md5:fc8d88b45252b028e7cff6a16f41a9e0
182.7 MB Download
md5:60ebf4c21a28ed4bf88ebedf8b0965d2
137.7 kB Download
md5:3c1c5e78e90be59645c10063ffc18b4a
487.3 MB Download
md5:5ab06a91271435bbd089546980371503
123.3 kB Download
md5:ed723d98e20dfc66af95acab7f0a17e9
178.5 MB Download
md5:d273fd8534c861a17eb34b21214d667f
132.0 kB Download
md5:7d818444cd0c07f8b5f94d2a899ea408
435.9 MB Download
md5:b0d31c4b00a749ca3dd546fc774f1312
109.3 kB Download
md5:d42c9b339e4b3d926badfa35115f1420
159.8 MB Download
md5:a6d0c2bf5fc455f046ac936eb9295ff8
116.3 kB Download
md5:3106f128d24a27dac7569ce9ff79ef4c
409.4 MB Download
md5:4a10521c17313380a0805013b7f14480
104.3 kB Download
md5:6b073573fa7d32b76b0ce748725a2c99
149.8 MB Download
md5:4479705aa587f4962477262ad0cbe0ea
110.9 kB Download
md5:15bdf41eeff726985eedd27394fae585
327.8 MB Download
md5:b38641927d0ad30c8654836d85e35c02
80.8 kB Download
md5:26ebc20dbfa2d1a9bce167ddee694fd9
118.4 MB Download
md5:b433f666cd4bb5e75eefef1907283c38
86.0 kB Download

Additional details

Related works

Is described by
Preprint: 10.64898/2026.02.20.707006 (DOI)

References

  • Bergström, A. et al. Insights into human genetic variation and population history from 929 diverse genomes. Science 367, eaay5012 (2020).
  • Mallick, S. et al. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations. Nature 538, 201–206 (2016).
  • Fairley, S., Lowy-Gallego, E., Perry, E. & Flicek, P. The International Genome Sample Resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Research 48, D941–D947 (2020).
  • Willems, T. et al. Genome-wide profiling of heritable and de novo STR variations. Nature methods 14, 590–592 (2017).