Newmap: minimum unique mappable read lengths: GRCh38.p14: range ≤8 bp to 65535 bp
Authors/Creators
Description
Minimum unique mappable read lengths: GRCh38.p14: range ≤8 bp to 65535 bp
Assembly: GRCh38.p14
Range: ≤8 bp to 65535 bp
Newmap version: 0.2
Files
GCA_000001405.15_GRCh38_no_alt_analysis_set.unique.zip: A zip archive containing a unique.uint16 file for each sequence in the no alternative loci scaffolds analysis set for GRCh38.p14.
Methods
For each position in the genome, Newmap determines the minimum length of a sequencing read starting at that position that would occur only once in the assembly. Newmap searches both the primary sequence and the reverse complement. Positions whose minimum unique mappable lengths are in [1, 8] are reported as 8. Positions whose unique lengths are greater than 65535 are reported as 0.
File names
All filenames begin with a sequence ID from the genome reference. The extension unique.uint16 indicates unique lengths encoded as a flat array of unsigned little-endian 16-bit integers.
Files
GCA_000001405.15_GRCh38_no_alt_analysis_set.unique.zip
Files
(952.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:22f2c7a11194b44f5622a2c87ea307bf
|
952.1 MB | Preview Download |
Additional details
Funding
- Natural Sciences and Engineering Research Council
- Computational methods for chromatin data RGPIN-2022-05134
Software
- Repository URL
- https://github.com/hoffmangroup/newmap
- Programming language
- Python , C
- Development Status
- Active