Published June 17, 2025 | Version 1.3
Dataset Open

Easy genomic regions for short-read variant calling

Description

Panmask easy regions:

  • pm151a.easy: excluding 151-mers occurring ≥1.01n times in the pangenome, where n is the number of genomes.
  • pm151b.easy: further excluding low-complexity regions (LCRs) longer than 18bp identified by SDUST
  • pm401a.easy: 401-mers. Not well evaluated.
  • pm401b.easy: excluding LCRs longer than 30bp

They exclude all assembly gaps and non-chromosomal contigs. See panmask for details.

Files

Files (35.0 MB)

Name Size Download all
md5:78ea118a3e97ab8c6a3dc51d3b02d45b
2.4 MB Download
md5:0d1efde1ebd3ce3d8d49097530343d8a
9.2 MB Download
md5:6784264aefd34985180ecf137c8c7fa7
706.1 kB Download
md5:1f0205168da0ad9ac46e9b8d8e9e25fb
5.4 MB Download
md5:76ecce087dc66938dc6a91ab56cd588f
2.3 MB Download
md5:69dd49bfc77b612a6c72b55a7111e4c9
9.1 MB Download
md5:1047f01397bc96e72d8149ae0609826b
667.9 kB Download
md5:193c3ea9ff2e25b8a37cd0be8a74cfa5
5.2 MB Download

Additional details

Related works

Is derived from
Dataset: 10.5281/zenodo.13948741 (DOI)
Is described by
Software: https://github.com/lh3/panmask (URL)