Published September 9, 2020 | Version 1.0
Dataset Open

Synthetic Noisy Reads Generated from Human Genomes

Authors/Creators

  • 1. University of Southern California

Description

This repository contains datasets with size 200K, 400K and 1 Million noisy reads generated from 5000 transcripts of GTcenters.fasta file. The assigned task is to recover the ground-truth transcripts (GTcenters) based on the given noisy reads.

Files

Cluster_ids_1M.csv

Files (1.9 GB)

Name Size Download all
md5:006a75ccb32254506f6a5fb4a268d6e1
5.8 MB Preview Download
md5:bf9ff9315ddd48464f1b1491e02db2e7
1.2 MB Preview Download
md5:c5c8ff774966201992451a9ee89b808e
2.3 MB Preview Download
md5:3fb135bd9c8c123d2872f3b2820bcca3
6.2 MB Download
md5:7975e20604e8ffb357870b62c5db2445
1.2 GB Download
md5:bda41693bef1610b3dba0f92fdaef534
237.8 MB Download
md5:3b5f6c698ec246238012730b3468acad
488.9 MB Download