Published November 8, 2025 | Version v2
Dataset Open

Deep learning models for predicting Toxicity and Bioactivity of the chemical exposome: a case study for the Blood Exposome Database

  • 1. ROR icon Icahn School of Medicine at Mount Sinai

Description

Input datasets and result tables for the publication on predicting toxicity and bioactivity for the compounds in the blood exposome database. 

 

Table 2: GHS classification (make an extra column with training vs test tags also add compound properties to this table). CIDs in this table have been deduplicated. 

Table 3: Tox21 input data (see final output table and create a similar matrix just with train and test tags)

Table 4a: Tox21 compound data (properties of compounds)

Table 4b: Tox21 compound bioactivity data

Tables 5: Tox21 assay list with AUC values and calculated optimal threshold for activity classification (44 assays show AUC over 0.80) 

Table6a: Tox21 predictions for Blood exposome (raw values)

Table6b: Tox21 predictions for Blood exposome (binary values with optimal thresholds applied Note: Duplicate CIDs were included

Table7a: GHS model predictions for blood exposome compounds (raw values)

Table 7b:  GHS model predictions for blood exposome compounds (binary values) Note: Duplicate CIDs were included 

Table 8 : Compound frequency by assay

Table 9: Tox21 – Blood exposome-GHS overlap  (Table goes to zenodo) (Should be based on CID no need to smiles standardization)

 

Files

zenodo_table4b.csv

Files (487.4 MB)

Name Size Download all
md5:5668a376460a3e7fa2f40f78c896c968
3.0 MB Preview Download
md5:a96d9c76c86efd27e5294a0d6aff493f
75.2 MB Preview Download
md5:da3d777d7472dd315d51ed8e46930db3
42.9 MB Preview Download
md5:00ff50e18cea28a0e5f84ef0e8d660d4
3.6 MB Download
md5:ffe47882249d54e4e2f6c4d38eba5966
132.0 MB Preview Download
md5:5cf831d03f1c2fba3129bf4f38fcfc84
15.4 kB Download
md5:268d44dbc32e5fbf2ef40975a86dc2d8
177.0 MB Preview Download
md5:572013c192e4509595a3f5acab9b50ea
33.1 MB Preview Download
md5:e9fe84091985ee96e8eb75053ef1c5f0
2.4 MB Preview Download
md5:e012b978f87c76ece332c1b394795fd6
2.0 MB Preview Download
md5:669901c9dca8e1337bb686f223c24d6b
359.7 kB Download
md5:483d67334a1f88e3b188da3fb7d975e3
15.9 MB Preview Download

Additional details

Funding

National Institutes of Health
Exposome Correlation and Interpretation Database (ECID) 5U24ES035386-02