Published April 17, 2026 | Version v1.1.0
Dataset Open

SMaHT HapMap Mixture Benchmark Set

  • 1. ROR icon Washington University in St. Louis
  • 2. Washington University in Saint Louis School of Medicine
  • 3. McDonnell Genome Institute Washington University School of Medicine
  • 4. Van Andel Institute
  • 5. Washington University School of Medicine in St. Louis

Description

This directory contains highly accurate somatic benchmarking single nucleotide variants (SNVs), small indels (indels), structural variations (SVs), mobile element insertions (MEIs) and mitochondrial variants(mito) for the HapMap cell line mixture. The HapMap mixture consists of 83.5% of HG005 (Chinese male), 10% of HG02622 (European/Ashkenazi Jewish male), 2% of HG002 (African female), 2% of HG02257 (African male), 2% of HG02486 (East Asian female), and 0.5% HG00438 (Chinese male). The vcf.gz and bed files are intended to be used together to benchmark the performance of somatic variant callers. The benchmarking region is defined as the intersection of reliable regions across the 12 haploid assemblies from the 6 HapMap samples, encompassing 83.9% of GRCh38 and 84.0% of CHM13, respectively. Each benchmark set was produced from a minigraph-cactus graph constructed using: 1) The selected reference assembly (GRCh38, T2T-CHM13, HG005 maternal/paternal assemblies, or a personalized HapMap mixture assembly) and 2) Twelve haploid assemblies from the six HapMap samples. The following paper(s) can be cited for use of the benchmark variant set: Kong, Nahyun, et al. "A Pangenomic Method for Establishing a Somatic Variant Detection Resource in HapMap Mixtures." bioRxiv (2025): 2025-09. 

Files

HapMapMixture_GRCh38.zip

Files (1.6 GB)

Name Size Download all
md5:592289f5a89b31dd1906a7c800673f24
295.9 MB Preview Download
md5:6415127baf0e766ba098d9a695fe087a
221.4 MB Preview Download
md5:66ecb69437cd6d46e24e18af45078238
575.3 MB Preview Download
md5:a1b42e97ac8bf719e77483e68b5dcff3
504.4 MB Preview Download
md5:428251057144326f1b319993ca5ecf6c
1.4 kB Preview Download

Additional details

Related works

Is described by
Journal: 10.1101/2025.09.29.679336 (DOI)

Dates

Updated
2025-11-06
Missing files added to HG005 and personalized assembly sets