There is a newer version of the record available.

Published February 12, 2022 | Version v2
Dataset Open

Sample graphs and sequences for testing sequence-to-graph alignment

Creators

  • 1. DFCI & HMS

Description

File descriptions:

  • MHC-61.agc: 61 complete MHC sequences, including GRCh38, CHM13 and 59 haplotypes extracted from HPRC year-1 assemblies. Use AGC to extract individual haplotype sequences.
  • MHC-57.gfa.gz: sequence graph constructed by minigraph-r434, excluding sample HG002 and HG005
  • C4-96.agc: 96 complete C4 sequences obtained from HPRC
  • C4-90.gfa.gz: sequence graph extracted from the full HPRC year-1 minigraph graph around the C4A/C4B genes. The graph takes GRCh38 as the reference and excludes sample HG002, HG005 and NA19240.

Files

Files (4.1 MB)

Name Size Download all
md5:cf1500af2f9163f22bb488c73a08d444
44.4 kB Download
md5:46264aa31f907cfa9887abd734867608
91.2 kB Download
md5:b82e77ae64700236c53922b9b2a27d5b
1.5 MB Download
md5:ec881c1b0e7fb48eb29aff5ecf4636a5
18.9 kB Download
md5:73c3baa79db2f9f6c90ddfe25cb89eab
2.5 MB Download