Published March 6, 2025 | Version AIVIVE 1.0
Dataset Open

AIVIVE: A Novel AI Framework for Enhanced In Vitro to In Vivo Extrapolation (IVIVE) of Toxicogenomics Data

Description

The dataset consists of transcriptomic profiles from rat liver tissue, curated from Open TG-GATEs database (link in references), along with predictions generated by the AIVIVE generator model.

The transcriptomic profiles are derived from both in vitro and in vivo experiments involving single-dose treatments of various compounds. The data is preprocessed using the RMA (Robust Multi-array Average) method, which ensures that the data is adjusted for batch effects and other systematic variations.

  • Training Data: 80% of the data is used for training the machine learning models. This subset is based on the unique compounds, meaning each compound has corresponding transcriptomic data across different exposures.
  • Test Data: 20% of the data is held back as a test set to evaluate the model's performance and generalization ability.

The dataset was obtained from Download - Open TG-GATEs | LSDB Archive. RMA normalization was performed in R (version 4.4.1). Additionally, the predictions from the optimal AIVIVE generator model for both training and testing sets are included that were used for further analysis.

Files:

Files

generator1_encoded_prediction_9962160_VivoGenerator.csv

Files (8.9 GB)

Name Size Download all
md5:8d3e92b81192baa680b879eecd5d332b
2.6 GB Preview Download
md5:2d1d75d6634d96e34be6c27610fe8744
661.3 MB Preview Download
md5:139f24ef70d1af8730f3e08ae032d723
1.9 GB Preview Download
md5:217681c2335da3371182588ec1ff5a33
3.7 GB Preview Download

Additional details

Dates

Available
2025-04-09

Software

Repository URL
https://github.com/CHANDMX20/AIVIVE
Programming language
Python , R
Development Status
Active

References

  • Igarashi Y, Nakatsu N, Yamashita T, Ono A, Ohno Y, Urushidani T, Yamada H. Open TG-GATEs: a large-scale toxicogenomics database. Nucleic Acids Res. 2015 Jan;43(Database issue):D921-7. doi: 10.1093/nar/gku955. Epub 2014 Oct 13. PMID: 25313160; PMCID: PMC4384023.