Published February 12, 2022 | Version v3
Dataset Open

Sample graphs and sequences for testing sequence-to-graph alignment

Creators

  • 1. DFCI & HMS

Description

File descriptions:

  • MHC-61.agc: 61 complete MHC sequences, including GRCh38, CHM13 and 59 haplotypes extracted from HPRC year-1 assemblies. Use AGC to extract individual haplotype sequences.
  • MHC-57.gfa.gz: sequence graph constructed by minigraph-0.18, excluding sample HG002 and HG005
  • C4-96.agc: 96 complete C4 sequences obtained from HPRC
  • C4-90.gfa.gz: sequence graph extracted from the full HPRC year-1 minigraph graph (v0.17) around the C4A/C4B genes. The graph takes GRCh38 as the reference and excludes sample HG002, HG005 and NA19240.

Files

Files (4.2 MB)

Name Size Download all
md5:af5d296c23147b2ddb4137c21c75863e
46.0 kB Download
md5:46264aa31f907cfa9887abd734867608
91.2 kB Download
md5:c41da94791c41a781b5f8a06c2e17f89
1.5 MB Download
md5:9e702ea0f5e08bd40d7e3ee9fa73502e
18.1 kB Download
md5:73c3baa79db2f9f6c90ddfe25cb89eab
2.5 MB Download