Published December 16, 2025 | Version v1
Dataset Open

Curated chemogenomics datasets for drug sensitivity prediction

Authors/Creators

Contributors

Data curator:

Description

Two files are provided: 'rma_data_Dec2025.pkl' and 'sensitivity_data_Dec2025.pkl'

'rma_data_Dec2025.pkl' -  rma gene expression data extracted and curated from the Genomics of Drug Sensitivity in Cancer (GDSC) repository, reporting the cosmic ID, the cell line name, the tissue site and the spesific histology type.

'sensitivity_data_Dec2025.pkl' - sensitivity data expressed as LN IC50, AUC and Z-score normalized IC50 values. Each cell line-drug interaction is annotated with the the additional following information: cell line cosmic ID, cell line name, drug id, drug name, putative  drug target, pathway involved, nonstereo aromatic smile of the drug.

Files

Files (139.6 MB)

Name Size Download all
md5:a53664604b7cd8a72c127e65f7b1bd43
126.6 MB Download
md5:9ac164bd611bd43a6964a2ed87c7a0c0
13.0 MB Download

Additional details

Dates

Created
2025-12-16