Published November 4, 2025 | Version v1
Dataset Open

Datasets used in "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment"

  • 1. ROR icon Universidad de Zaragoza
  • 2. Instituto Universitario de Investigación en Ingenería de Aragón [I3A]
  • 3. ROR icon Barcelona Supercomputing Center
  • 4. ROR icon Universitat Politècnica de Catalunya
  • 5. ROR icon Universitat Autònoma de Barcelona

Description

Datasets used in the "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment" paper. It includes:

File Size Source
Illumina.250.10000000.seq 10 million sequence pairs (5GB) NIST Genome in a Bottle (GIAB) project
ONT.PromethION-50K.seq 9 sequence pairs (0.5MB) Precision GDA Truth Challenge V2 (subset)
PacBio.HF.1000000.seq 1 million sequence pairs (25.7GB) Human Pangenome Reference Consortium


This dataset is derived from original data produced by third parties, as detailed above. All rights to the original data remain with the original authors or copyright holders. Users are responsible for ensuring compliance with the licensing terms of the original data sources.

Files

singletrack_datasets.zip

Files (5.0 GB)

Name Size Download all
md5:a081f89040ddffcbe2c354f98fb45437
5.0 GB Preview Download