Published April 7, 2023 | Version 1.0.0
Dataset Open

Fibertools: fast and accurate m6A calling using single-molecule long-read sequencing (ML data)

  • 1. Department of Genome Sciences, University of Washington, Seattle, WA, USA
  • 2. Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
  • 3. Department of Statistics, University of Washington, Seattle, WA, USA
  • 4. Department of Medical Chemistry, University of Washington, Seattle, WA, USA
  • 5. Department of Genome Sciences, University of Washington, Seattle, WA, USA; Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA

Description

Fibertools is a convolutional neural network that permits the fast and accurate identification of endogenous and exogenous N6-methyladenine (m6A)-marked bases using single-molecule long-read sequencing. This dataset (ML data) provides training and validation data for training fibertools supervised and semi-supervised CNN models for three long-read chemistries.

Files

Files (10.0 GB)

Name Size Download all
md5:c92eca89ba0b914ecc273c148348900f
10.0 GB Download