Published November 22, 2025 | Version PubChemTrans-0.2.2
Dataset Open

Transformations in PubChem - Full Dataset

  • 1. LCSB, Uni Luxembourg
  • 2. NCBI/NLM/NIH
  • 3. University of Amsterdam

Description

This is an archive of the data contained in the "Transformations" section in PubChem for integration into patRoon and other workflows.

For further details see the ECI GitLab site: README and main "tps" folder.

Credits:

Concepts: E Schymanski, E Bolton, J Zhang, T Cheng;

Code (in R): E Schymanski, R Helmus, P Thiessen

Transformations: E Schymanski, J Zhang, T Cheng and many contributors to various lists!

PubChem infrastructure: PubChem team

Acknowledgements: ECI team who contributed to related efforts, especially: J. Krier, A. Lai, M. Narayanan, T. Kondic, P. Chirsir, E. Palm, Bashir Mayahi. All contributors to the NORMAN-SLE transformations!

March 2025 released as v0.2.0 since the dataset grew by >3000 entries! Nov 2025 updated to new SMILES fields.

The stats are: 

# 22 Nov. 2025

Unique Transformation Entries: 11190
Unique Reactions by CID: 9323
Unique Reactions by IK: 9310
Unique Reactions by IKFB: 8742
Unique NORMAN-SLE Compounds by CID: 8377
Unique ChEMBL Compounds by CID: 1419
Unique Compounds (all) by CID: 9435
Unique Predecessors (all) by CID: 3811
Unique Successors (all) by CID: 7454
Range of XlogP Differences: -12.5,10
Range of Mass Differences: -957.97490813,820.227106427

Notes

These files are in active development; formats may change in future versions

Files

PubChem_all_transformations_wExtraInfo.csv

Files (19.2 MB)

Name Size Download all
md5:04320e5423d97f143915da108933f299
5.4 MB Preview Download
md5:eac195d8479658a749d2ae65753a241e
10.0 MB Preview Download
md5:85ddfe61925a3f8f545f0c0a14795070
3.8 MB Preview Download

Additional details

Related works

Is derived from
https://pubchem.ncbi.nlm.nih.gov/ (URL)
Is supplement to
10.1186/s13321-018-0277-8 (DOI)