Published October 23, 2023 | Version v1
Journal article Open

Per- and Polyfluoroalkyl Substances (PFAS) in PubChem: 7 Million and Growing

  • 1. ROR icon University of Luxembourg
  • 2. ROR icon National Center for Biotechnology Information

Description

Paper published in Environmental Sceince and Technology

Abstract

Per- and polyfluoroalkyl substances (PFAS) are of high concern, with calls to regulate them as a class. In 2021, the Organisation for Economic Co-operation and Development (OECD) revised the definition of PFAS to include any chemical containing at least one saturated CF2 or CF3 moiety. The consequence is that one of the largest open chemical collections, PubChem, with 116 million compounds, now contains over 7 million PFAS under this revised definition. These numbers are several orders of magnitude higher than previously established PFAS lists (typically thousands of entries) and pose an incredible challenge to researchers and computational workflows alike. This article describes a dynamic, openly accessible effort to navigate and explore the >7 million PFAS and >21 million fluorinated compounds (September 2023) in PubChem by establishing the “PFAS and Fluorinated Compounds in PubChem” Classification Browser (or “PubChem PFAS Tree”). A total of 36500 nodes support browsing of the content according to several categories, including classification, structural properties, regulatory status, or presence in existing PFAS suspect lists. Additional annotation and associated data can be used to create subsets (and thus manageable suspect lists or databases) of interest for a wide range of environmental, regulatory, exposomics, and other applications.

Files

schymanski-et-al-2023-per-and-polyfluoroalkyl-substances-(pfas)-in-pubchem-7-million-and-growing.pdf

Additional details

Funding

European Commission
ZeroPM - ZeroPM: Zero pollution of Persistent, Mobile substances 101036756