Published November 24, 2020 | Version v1
Dataset Open

Benchmark data used in the evaluation of ChiRA tool-suite.

Authors/Creators

Description

The reads generated mimic CLASH experimental data. Each read is a fusion of a human hg38 miRBase mature miRNAs and some random TargetScan target sequences. The numbers 10, 12, 15, 18, and 20 in the file names represent the length of the chimeric arms. The are 1million reads in FASTA file. Files with "Insert" in their names contain a short 5 nt random sequence, whereas the "noInsert" files do not.  TargetScanSites_merged.fa.fasta and hg38_MIR_mature.fa.fasta contain the reference sequences. The sequence identifiers in the FASTA file are in the following format:

>hsa-miR-193b-5p_MIMAT0004767:1-20||chr7:114656010-114656173+:82-95

where,

  • hsa-miR-193b-5p_MIMAT0004767 is the sequence ID of the miRNA from which first chimeric arm of this read was derived from. This ID can be found in hg38_MIR_mature.fa.fasta.
  • 1-20 represents the 1-based start and end positions on hsa-miR-193b-5p_MIMAT0004767 representing the origin of the first chimeric arm.
  • chr7:114656010-114656173+ is the sequence ID of the TargetScan target site from which the second chimeric arm of this read was derived from. This ID can be found in TargetScanSites_merged.fa.fasta
  • 82-95 represents the 1-based start and end positions on chr7:114656010-114656173+ representing the origin of the second chimeric arm.

Files

Files (1.5 GB)

Name Size Download all
md5:811ae7e56fb1ffc6c4be279567b12af0
133.2 kB Download
md5:eda8321be3fab56034269e724f3564ca
92.7 MB Download
md5:81427bab7722ed9bf14d593e34854c42
87.7 MB Download
md5:11a2702f3b2f898c00f33f66d5e23896
97.5 MB Download
md5:f7aa0b54bbe3740e22ed36b39ff91b0f
91.5 MB Download
md5:1bfca9940f9f33e54c12ed3f02d6d1d5
102.5 MB Download
md5:1d7305500843bd782fdb21e1313aa005
96.8 MB Download
md5:1d7305500843bd782fdb21e1313aa005
96.8 MB Download
md5:f4189fa7682e0fc63ca636b25d0c332c
107.5 MB Download
md5:f4189fa7682e0fc63ca636b25d0c332c
107.5 MB Download
md5:ddd00154038d3fa46b66d267e7ac7beb
99.8 MB Download
md5:ddd00154038d3fa46b66d267e7ac7beb
99.8 MB Download
md5:8dc1b54666bdba32697210a6b72475c3
112.5 MB Download
md5:8dc1b54666bdba32697210a6b72475c3
112.5 MB Download
md5:c4c4ae2023d0371faa2d03f3ee4b57f6
101.6 MB Download
md5:c4c4ae2023d0371faa2d03f3ee4b57f6
101.6 MB Download
md5:1512126578f82e8d016b3f124375fd74
5.6 MB Download
md5:1512126578f82e8d016b3f124375fd74
5.6 MB Download