Published December 18, 2025 | Version v1
Dataset Open

Dataset: Earthquake Seismogram Denoiser (EQS-Denoiser)

Authors/Creators

  • 1. ETH Zurich
  • 1. ROR icon ETH Zurich
  • 2. EDMO icon ETH Zürich

Description

This dataset contains preprocessed earthquake event and noise waveforms, supplementary to the publication:

"Towards Operational Earthquake Seismogram Denoising"
by Nikolaj Dahmen, John Clinton, Men-Andrin Meier, Luca Scarabello
Bulletin of the Seismological Society of America

Waveform data are provided in physical units [m/s],  Z12 components and with a sampling rate of 100 Hz. Event signals have been preprocessed according to the methods described in the publication to reduce background noise (see Figure 2). Event signals have 18k samples with P arrival aligned at sample 6000, noise signals have 8k samples. Dataset statistics for event and noise waveforms are summarized in Figures S1 and S2. The dataset is separated into training, validation, and testing dataset, and are here available for publicly accessible networks. Metadata includes information on recording station, earthquake, and identified phase arrivals. A subset of the test dataset with fixed event and noise waveform pairs is provided in the benchmark dataset. 

Data are provided in HDF5 format with accompanying metadata, compatible with SeisBench.

 

Proposed dataset structure:

  • Event waveforms:

    • Event_data/waveforms.hdf5

    • Event_data/metadata.csv

  • Noise waveforms:

    • Noise_data/waveforms_noise.hdf5

    • Noise_data/metadata_noise.csv

 

Importing dataset with SeisBench:

import seisbench.data as sbd

# load noise dataset
noise_ds = sbd.WaveformDataset("/Noise_data/", sampling_rate=100, component_order="Z12")
# Training examples: 274,240
# Development examples: 81,416
# Test examples: 40,096

# load event dataset
event_ds = sbd.WaveformDataset("/Event_data/", sampling_rate=100, component_order="Z12")
# Training examples: 19,022
# Development examples: 5,385
# Test examples: 2,695

Data Sources:

Network Source Year Link
CH National Seismic Networks of Switzerland (SED, ETH Zurich) 1983 Link
9S Landslide deployments, Switzerland 2012 Link
8D Aftershock deployments, Switzerland 2025 Link
G2 GEOBEST geothermal project networks 2006 Link
S Seismology at School Program 2008 Link
C4 CERN Seismic Network 2016 Link
1I Matterhorn Cryosphere Observatory 2015 Link
Z3 AlpArray Temporary Network 2015 Link
GU Regional Network NW Italy 1967 Link
OX Northeast Italy OGS 2016 Link
IV Rete Sismica Nazionale (INGV) 2005 Link
8C Mont-Blanc & Vallorcine Monitoring (RESIF-SISMOB) 2020 Link
FR EPOS-France Broad-Band Network (RLBP) 1995 Link
RA RESIF-RAP French Accelerometric Network 1995 Link
BW BayernNetz, Germany 2001 Link
GR German Regional Seismic Network (GRSN) 1976 Link
LE Erdbebendienst SW Baden-Württemberg & RP 2009 Link
OE Austrian Seismic Network (ZAMG) 1987 Link
MN Mediterranean Very Broadband Seismographic Network (MedNet) 1990 Link

 

Dataset for benchmarking (Benchmark.zip), contains event and noise signals part of "test" partition in hdf5 files.

Contains event and noise signal combinations as used in performance test in publication, with randomly sampled noise (TestDataSet.zip) and impulsive noise signals (TestDataSet_ImpulsiveNoise.zip). The ReadMe File documents test procedure.

 

Related code

 

Files

Benchmark.zip

Files (91.6 GB)

Name Size Download all
md5:9c65e16066150a1d0b6bd6da82f70a06
3.9 GB Preview Download
md5:5a8809843f15b932c1919eca94a906fa
8.3 MB Preview Download
md5:9b6667030a2a9de2f767e90ec3bc7698
31.4 MB Preview Download
md5:93e9f65345493987602b7c893c3d113f
11.7 GB Download
md5:29586f61066bc34912898d23fab42968
76.0 GB Download

Additional details

Related works

References
Software: 10.5281/zenodo.17935347 (DOI)

Funding

European Commission
TRANSFORM2 - TowaRds AdvaNced multidiSciplinary Fault ObseRvatory systeMs² 101188365