Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.

There is a newer version of the record available.

Published June 27, 2022 | Version 1.11.0
Dataset Open

PubChemLite for Exposomics

  • 1. NIH/NLM/NCBI
  • 2. LCSB, University of Luxembourg

Description

This is the repository for regular updates of the PubChemLite for Exposomics data collection. PubChemLite for Exposomics is a subset of PubChem selected from major categories of the Table of Contents page at the PubChem Classification Browser, described in DOI:10.1186/s13321-021-00489-0.

PubChemLite for Exposomics is compiled from 10 categories: AgroChemInfo, BioPathway, DrugMedicInfo, FoodRelated, PharmacoInfo, SafetyInfo, ToxicityInfo, KnownUse, DisorderDisease, Identification.

PubChemCIDs have been collapsed by InChIKey first block, reporting the structure from the most annotated CID, plus related CIDs. Entries that will be ignored by MetFrag (salts, disconnected substances) or cause errors (e.g. transition metals) have been removed. The Patent and PubMed ID counts are extracted from files on the PubChem FTP site. The `AnnoTypeCount' term counts how many of the categories are represented, the subsequent column (named per category) counts the number of annotation categories available in the next sub-category of the TOC entry.

These files can be used `as is' as localCSV for MetFrag Command Line.

Notes

These files can be used "as is" for MetFrag (command line) and other high throughput workflows.

Please do NOT upload these files directly to the MetFrag web interface, they are too large. The latest updates will be available in a drop-down menu.

Files

PubChemLite_exposomics_20220624.csv

Files (195.0 MB)

Name Size Download all
md5:8de1f3e90c1eb9fd0e58a86d49559f80
195.0 MB Preview Download

Additional details

References

  • Schymanski, E.L., Kondić, T., Neumann, S. et al. Empowering large chemical knowledge bases for exposomics: PubChemLite meets MetFrag. J Cheminform 13, 19 (2021). https://doi.org/10.1186/s13321-021-00489-0