Published June 11, 2025 | Version v1
Dataset Open

Co-occurrences of chemicals, genes, proteins, and diseases in patent records in PubChem

Description

This Zenodo record contains the data and analysis source codes used in the work described in the paper “Summarizing relationships between chemicals, genes, proteins, and diseases in PubChem using analysis of their co-occurrences in patents” by Zaslavsky et al., submitted for publication in Journal of Cheminformatics. The  content in this Zenodo record is for archival purposes to support the publication, with no expectation of future updates or that issue reports will be addressed.

Users can explore the co-occurrence data interactively through the summary pages of individual compound and gene records or download it programmatically from  PubChem (https://pubchem.ncbi.nlm.nih.gov) as described in the paper.

Files

COOCCURRENCE_IN_PATENTS.zip

Files (6.3 GB)

Name Size Download all
md5:5812fab675f76b91ee3b0a49c525f1ff
6.3 GB Preview Download

Additional details

Software

Programming language
Java