Published April 1, 2026 | Version v1
Dataset Open

Newmap: minimum unique mappable read lengths: GRCh38.p14: range ≤8 bp to 65535 bp

  • 1. ROR icon University Health Network
  • 2. ROR icon University of Toronto
  • 3. ROR icon Vector Institute

Description

Minimum unique mappable read lengths: GRCh38.p14: range ≤8 bp to 65535 bp

Assembly: GRCh38.p14
Range: ≤8 bp to 65535 bp
Newmap version: 0.2

Files

GCA_000001405.15_GRCh38_no_alt_analysis_set.unique.zip: A zip archive containing a unique.uint16 file for each sequence in the no alternative loci scaffolds analysis set for GRCh38.p14.

Methods

For each position in the genome, Newmap determines the minimum length of a sequencing read starting at that position that would occur only once in the assembly. Newmap searches both the primary sequence and the reverse complement. Positions whose minimum unique mappable lengths are in [1, 8] are reported as 8. Positions whose unique lengths are greater than 65535 are reported as 0.

File names

All filenames begin with a sequence ID from the genome reference. The extension unique.uint16 indicates unique lengths encoded as a flat array of unsigned little-endian 16-bit integers.

Files

GCA_000001405.15_GRCh38_no_alt_analysis_set.unique.zip

Files (952.1 MB)

Additional details

Funding

Natural Sciences and Engineering Research Council
Computational methods for chromatin data RGPIN-2022-05134

Software

Repository URL
https://github.com/hoffmangroup/newmap
Programming language
Python , C
Development Status
Active