Published July 22, 2025 | Version v2
Dataset Open

Dataset Supporting Publication: A Curated Geospatial Dataset of Chemicals Used in Hydraulic Fracturing and their Functions

  • 1. United States Environmental Protection Agency
  • 2. ROR icon Environmental Protection Agency
  • 1. United States Environmental Protection Agency
  • 2. ROR icon Environmental Protection Agency
  • 3. ChemConnector
  • 4. US Environmental Protection Agency Office of Research and Development

Description

Dataset Files Supporting the publication: A Curated Geospatial Dataset of Chemicals Used in Hydraulic Fracturing and their Functions. 

Kristin K. Isaacs1,*, Antony J. Williams1, Katherine Phillips1, Michael Jayne2, and Sean Thimons2

1. Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, 109 T.W. Alexander Drive, P.O. Box 12055, Research Triangle Park, NC 27711

2. Center for Environmental Solutions and Emergency Response, Office of Research and Development, United States Environmental Protection Agency, 26 Martin Luther King Dr W, Cincinnati, OH 45220

*Corresponding author

Submitted for publication as of 7/22/2025. 

Abstract:

Produced water is wastewater generated by the oil and gas industry as a byproduct of conventional extraction and hydraulic fracturing activities. Chemicals used in HF may affect PW quality and must be evaluated for risk to humans and ecological species when considering the conditions of PW reuse. FracFocus is a public database of chemicals used in HF where operators of wells report information either voluntarily or as mandated under specific state statutes. However, data reported to FF is not standardized in terms of either chemical identifier or chemical function during HF. Here, we develop and provide a curated geospatial database of chemicals used in HF and reported to FF, harmonized to standard chemical identifiers and function categories used by the U.S. Environmental Protection Agency, which facilitates direct linkage to its chemical exploration tools such as the CompTox Chemicals Dashboard. This harmonized dataset provides an enhanced resource to risk assessors and environmental programs when evaluating the potential impact of HF chemicals in PW reuse.

Associated code available at: https://github.com/HumanExposure/ProducedWaterChemicalCuration

Included files:

finalcurateddata.zip: Final dataset of curated chemical and function records in CSV format (compressed)

Input: Input files for running code (stub in code repo) 

Output: Intermediate output files (stub in code repo) with the exception of final dataset, including final dataset of curated chemical and function records in R RDS format (finalcuratedalldata.RDS) 

The zipped Input and Output folders can be unzipped and placed in the same directory with the code (see link above) to re-run the curation workflow. Note however, that some intermediate files are the result of manual creation processes. The original FracFocus data would also need to be obtained from the FracFocus site. 

Disclaimer:

The United States Environmental Protection Agency, through its Office of Research and Development, funded and managed the research and data described here. e data included herein do not represent and should not be construed to represent any Agency determination or policy. Further, any reference to commercial products or services does not constitute endorsement. The referenced manuscript has been subject to Agency review and approved for publication. 

 

 

Files

finalcuratedalldata.zip

Files (1.1 GB)

Name Size Download all
md5:1771317afa35202a397ad2d5d2b71c4c
452.4 MB Preview Download
md5:b688add532f474c3269763597e5ab98d
7.3 MB Preview Download
md5:c97dd0c1c49ce0a63c5dbd92c2bcd53a
650.0 MB Preview Download

Additional details

Dates

Created
2025-05-13