There is a newer version of the record available.

Published July 31, 2023 | Version v1
Dataset Open

Datasets of positive and negative miRNA-target interactions

  • 1. Ben-Gurion University of the Negev

Description

MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally via base-pairing with complementary sequences on messenger RNAs (mRNAs). Computational approaches that predict miRNA target interactions (MTIs) facilitate the process of narrowing down potential targets for experimental validation. The availability of new datasets of high-throughput, direct MTIs has led to the development of machine learning (ML) based methods for MTI prediction. To train an ML algorithm, there is a need to supply entries from all class labels (i.e., positive and negative). Currently, no high-throughput assays exist for capturing negative examples, hindering effective classifier construction. Therefore, current ML approaches must rely on artificially generated negative examples for training. Moreover, the lack of uniform standards for generating such data leads to biased results and hampers comparisons between studies.  We investigated the impact of different methods to generate negative data on the classification of true MTIs. The study relies on training ML models on a fixed positive dataset in combination with different negative datasets and evaluating their intra- and cross-dataset performance. As a result, we were able to examine each method independently and evaluate ML models’ sensitivity to the methodologies utilized in negative data generation.

This data include all the negative datasets that generated by the different methods and the positive data that was used.

Files

CLIP_non_CLASH.csv

Files (2.8 GB)

Name Size Download all
md5:3522d87e10be53e4e5f3be59bdea773d
1.9 GB Preview Download
md5:b13c49d0572af917aa436659b5825a86
34.8 MB Preview Download
md5:1344359e3ff21f6d449f897f59e9cf6b
31.0 MB Preview Download
md5:b9a8cead4e81dd467a322bd43e7c8421
31.0 MB Preview Download
md5:2f8f24c5eb743e0918b7c796ce364d8e
31.1 MB Preview Download
md5:a18dd4ea04640f3e56128b3d0c8ebdfb
31.2 MB Preview Download
md5:51970d0ef4269fca0520e1a2d5c00ce8
31.0 MB Preview Download
md5:bbd3a82fc0640f8e8f5f279b7ba86c81
31.0 MB Preview Download
md5:8b168016c949020836052517612513f0
31.1 MB Preview Download
md5:d5fa25b0b50c6d2f4facb513d0da1a75
30.0 MB Preview Download
md5:5094902902c48a54b14e48cdf2ea9452
29.8 MB Preview Download
md5:f82e054bb1a83d8fc30ccefdde3d8ac1
29.3 MB Preview Download
md5:60eaa59514c1b9fdaa262bf2c6fa9c23
41.1 MB Preview Download
md5:d334ce579cafe340a49f496ebc2e151e
243.9 MB Preview Download
md5:6a5e3f3b9b8ddba0dff50619752213ac
32.1 MB Preview Download
md5:2af980f508db818312b50c0cbde1ceef
214.8 MB Preview Download