Published October 17, 2024 | Version v1.0.0
Dataset Open

NanoBaseLib: A Multi-Task Benchmark Dataset for Nanopore Sequencing

  • 1. ROR icon Aalto University
  • 2. ROR icon University of Eastern Finland

Description

NanoBaseLib is a multi-task benchmark dataset for Nanopore Sequencing. We compile and preprocess publicly available datasets using a unified pipeline to ensure consistency and quality across all tasks. The dataset is benchmarked for four key Nanopore sequencing tasks: base calling, polyA detection, segmentation and event alignment, and RNA modification detection.  NanoBaseLib is available at https://nanobaselib.github.io

Files

Files (190.2 GB)

Name Size Download all
md5:660cddccbfb18f1049abc809b4f77b81
3.6 GB Download
md5:c94aeb03bba9ba5a89ea9ec278b9b82d
319.1 MB Download
md5:6d812f2f3d9f658face55442b4a5d089
1.1 GB Download
md5:4627b533cd5588a70742ede1cd473ef4
6.6 GB Download
md5:f36b2d9085c0412d557226fd4935fa3b
357.9 MB Download
md5:af992500d7f1c3a1999e2414a1f7e414
1.4 GB Download
md5:9ee3d3b5874aacb801560f35be387f5a
6.0 GB Download
md5:b95e15df2d4ce9a3b59811abc93fbefb
307.7 MB Download
md5:726940efc0b436b3b0666aae89e8abce
81.9 GB Download
md5:70b89dda71debe0c8bfbf2e91b4d0b99
394.3 MB Download
md5:aae1f776197feaa0af032f27d5ed20ca
228.5 MB Download
md5:1dd4703896d42f84144bf9a5a9e82b48
376.4 MB Download
md5:40ab24640cf67ea11f9b7461c1b7b371
70.6 GB Download
md5:64aec2f6be456d37510bbed7f20da9ea
2.9 GB Download
md5:8ab01747d30490dc1e9a318fa6b659c6
110.5 MB Download
md5:9189dbc0ee8871250f1a4cbd604c704a
5.0 GB Download
md5:c0faaf7e9dc18abbed857988dd598a74
8.8 GB Download
md5:d62844cb5ce0583ce5376612397200c4
157.7 MB Download

Additional details