Published November 4, 2025
| Version v1
Dataset
Open
Datasets used in "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment"
Authors/Creators
Description
Datasets used in the "Singletrack: An Algorithm for Improving Memory Consumption and Performance of Gap-Affine Sequence Alignment" paper. It includes:
| File | Size | Source |
| Illumina.250.10000000.seq | 10 million sequence pairs (5GB) | NIST Genome in a Bottle (GIAB) project |
| ONT.PromethION-50K.seq | 9 sequence pairs (0.5MB) | Precision GDA Truth Challenge V2 (subset) |
| PacBio.HF.1000000.seq | 1 million sequence pairs (25.7GB) | Human Pangenome Reference Consortium |
This dataset is derived from original data produced by third parties, as detailed above. All rights to the original data remain with the original authors or copyright holders. Users are responsible for ensuring compliance with the licensing terms of the original data sources.
Files
singletrack_datasets.zip
Files
(5.0 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:a081f89040ddffcbe2c354f98fb45437
|
5.0 GB | Preview Download |