Datasets used in "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment"

López-Villellas, Lorién; Iñiguez Rodriguez, Cristian; Jiménez-Blanco, Albert; Aguado-Puig, Quim; Moretó, Miquel; Alastruey-Benedé, Jesús; Ibáñez, Pablo; Marco-Sola, Santiago

doi:10.5281/zenodo.17525721

Published November 4, 2025 | Version v1

Dataset Open

Datasets used in "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment"

1. Universidad de Zaragoza
2. Instituto Universitario de Investigación en Ingenería de Aragón [I3A]
3. Barcelona Supercomputing Center
4. Universitat Politècnica de Catalunya
5. Universitat Autònoma de Barcelona

Datasets used in the "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment" paper. It includes:

File	Size	Source
Illumina.250.10000000.seq	10 million sequence pairs (5GB)	NIST Genome in a Bottle (GIAB) project
ONT.PromethION-50K.seq	9 sequence pairs (0.5MB)	Precision GDA Truth Challenge V2 (subset)
PacBio.HF.1000000.seq	1 million sequence pairs (25.7GB)	Human Pangenome Reference Consortium

This dataset is derived from original data produced by third parties, as detailed above. All rights to the original data remain with the original authors or copyright holders. Users are responsible for ensuring compliance with the licensing terms of the original data sources.

Files

singletrack_datasets.zip

Files (5.0 GB)

Name	Size	Download all
singletrack_datasets.zip md5:a081f89040ddffcbe2c354f98fb45437	5.0 GB	Preview Download

	All versions	This version
Views	69	69
Downloads	13	13
Data volume	64.4 GB	64.4 GB

Datasets used in "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment"

Authors/Creators

Description

Files

singletrack_datasets.zip

Files (5.0 GB)