Published March 28, 2024 | Version 0.1
Dataset Open

Modelling data and model results Tox21

  • 1. ROR icon Institute for Anthropological Research
  • 2. ROR icon Copenhagen Prospective Studies on Asthma in Childhood

Description

The data sets accompany the manuscript 

The data sets rely on previous work from 

M. Lovrić, "Razvoj i primjena modela za procjenu ekotoksikoloških rizika bioaktivnih kemijskih spojeva", Disertacija, Sveučilište u Zagrebu, Prirodoslovno-matematički fakultet, Zagreb, 2021. Dostupno na: https://urn.nsk.hr/urn:nbn:hr:217:730117

and

Lovrić M, Malev O, Klobučar G, Kern R, Liu JJ, Lučić B. Predictive Capability of QSAR Models Based on the CompTox Zebrafish Embryo Assays: An Imbalanced Classification Problem. Molecules. 2021 Mar 15;26(6):1617. doi: 10.3390/molecules26061617

the original data was downloaded from the Tox21 challenge https://tripod.nih.gov/tox21/challenge/data.jsp 

The files comprise of:

  1. tox21_desc_p194.csv   - descriptor data for the prepared Tox21 data set
  2. tox21_fp_all.csv - fingerprint and maccs data for the prepared Tox21 data set
  3. tox21_target_all.csv - 12 binary Tox21 target(labels) 
  4. standardizer_pipeline.xml - structure standardizer pipeline for ChemAxon Standardizer 
  5. tox21_models_metadata.json - model metadata such as sklearn hyperparameters and descriptor feature lists

 

 

Files

standardizer_pipeline.xml

Files (38.9 MB)

Name Size Download all
md5:e8e2f1c5f26f81c460dfd3040b6ed552
1.1 kB Preview Download
md5:6c4dbd415fabaff9ec287604540c8a93
14.7 MB Preview Download
md5:e2e723696ba2a41257ff421d7166b515
23.7 MB Preview Download
md5:b6466e52a92796c1c815a82b784f062a
15.1 kB Download
md5:47f84d6b8b49df0a507f64b1eeea94ce
11.1 kB Preview Download
md5:7da3ed8eedddf55c22a3ff4758f5cb27
456.1 kB Preview Download

Additional details

Related works

Continues
Publication: 10.3390/molecules26061617 (DOI)
Dissertation: urn:nbn:hr:217:730117 (URN)

Software

Programming language
Python