There is a newer version of the record available.

Published January 14, 2020 | Version PubChemLite.0.2.0
Dataset Open

PubChemLite tier0 and tier1

  • 1. NIH/NLM/NCBI
  • 2. LCSB, Uni Luxembourg

Description

PubChemLite is a subset of PubChem (https://pubchem.ncbi.nlm.nih.gov/) selected from major categories of the Table of Contents page at the PubChem Classification Browser (https://pubchem.ncbi.nlm.nih.gov/classification/#hid=72). So far we are providing two "flavours":

tier0 is 316,810 compounds (14 Jan 2020) compiled from 7 categories: AgroChemInfo, DrugMedicInfo, FoodRelated, PharmacoInfo, SafetyInfo, ToxicityInfo, KnownUse

tier1 is 363,911 compounds (14 Jan 2020) compiled from 8 categories (tier0 + BioPathway): AgroChemInfo, BioPathway, DrugMedicInfo, FoodRelated, PharmacoInfo, SafetyInfo, ToxicityInfo, KnownUse

PubChemCIDs have been collapsed by InChIKey first block, reporting the structure from the most annotated CID, plus related CIDs. Entries that will be ignored by MetFrag (salts, disconnected substances) or cause errors (e.g. transition metals) have been removed. The Patent and PubMed ID counts are extracted from files on the PubChem FTP site. The "AnnoTypeCount" term counts how many of the categories are represented, the subsequent column (named per category) counts the number of annotation categories available in the next sub-category of the TOC entry.

These files can be used "as is" as localCSV for MetFrag Command Line (https://ipb-halle.github.io/MetFrag/) - please do NOT upload these files directly to the web interface, they are too large and will be available in a drop-down menu.

Further details are described in Schymanski et al. (2021) DOI:10.1186/s13321-021-00489-0.

NOTE: The latest PubChemLite for Exposomics version can be downloaded at DOI:10.5281/zenodo.5995885 (currently updating monthly).

Files

PubChemLite_14Jan2020_tier0.csv

Files (352.1 MB)

Name Size Download all
md5:9fd25cfca58138f5a957bc6894a410ae
160.6 MB Preview Download
md5:980fa01be7a675430ce425159c1effca
191.5 MB Preview Download

Additional details