Dataset: Earthquake Seismogram Denoiser (EQS-Denoiser)
Contributors
Researcher (3):
Description
This dataset contains preprocessed earthquake event and noise waveforms, supplementary to the publication:
"Towards Operational Earthquake Seismogram Denoising"
by Nikolaj Dahmen, John Clinton, Men-Andrin Meier, Luca Scarabello
Bulletin of the Seismological Society of America
Waveform data are provided in physical units [m/s], Z12 components and with a sampling rate of 100 Hz. Event signals have been preprocessed according to the methods described in the publication to reduce background noise (see Figure 2). Event signals have 18k samples with P arrival aligned at sample 6000, noise signals have 8k samples. Dataset statistics for event and noise waveforms are summarized in Figures S1 and S2. The dataset is separated into training, validation, and testing dataset, and are here available for publicly accessible networks. Metadata includes information on recording station, earthquake, and identified phase arrivals. A subset of the test dataset with fixed event and noise waveform pairs is provided in the benchmark dataset.
Data are provided in HDF5 format with accompanying metadata, compatible with SeisBench.
Proposed dataset structure:
-
Event waveforms:
-
Event_data/waveforms.hdf5 -
Event_data/metadata.csv
-
-
Noise waveforms:
-
Noise_data/waveforms_noise.hdf5 -
Noise_data/metadata_noise.csv
-
Importing dataset with SeisBench:
import seisbench.data as sbd
# load noise dataset
noise_ds = sbd.WaveformDataset("/Noise_data/", sampling_rate=100, component_order="Z12")
# Training examples: 274,240
# Development examples: 81,416
# Test examples: 40,096
# load event dataset
event_ds = sbd.WaveformDataset("/Event_data/", sampling_rate=100, component_order="Z12")
# Training examples: 19,022
# Development examples: 5,385
# Test examples: 2,695
Data Sources:
| Network | Source | Year | Link |
|---|---|---|---|
| CH | National Seismic Networks of Switzerland (SED, ETH Zurich) | 1983 | Link |
| 9S | Landslide deployments, Switzerland | 2012 | Link |
| 8D | Aftershock deployments, Switzerland | 2025 | Link |
| G2 | GEOBEST geothermal project networks | 2006 | Link |
| S | Seismology at School Program | 2008 | Link |
| C4 | CERN Seismic Network | 2016 | Link |
| 1I | Matterhorn Cryosphere Observatory | 2015 | Link |
| Z3 | AlpArray Temporary Network | 2015 | Link |
| GU | Regional Network NW Italy | 1967 | Link |
| OX | Northeast Italy OGS | 2016 | Link |
| IV | Rete Sismica Nazionale (INGV) | 2005 | Link |
| 8C | Mont-Blanc & Vallorcine Monitoring (RESIF-SISMOB) | 2020 | Link |
| FR | EPOS-France Broad-Band Network (RLBP) | 1995 | Link |
| RA | RESIF-RAP French Accelerometric Network | 1995 | Link |
| BW | BayernNetz, Germany | 2001 | Link |
| GR | German Regional Seismic Network (GRSN) | 1976 | Link |
| LE | Erdbebendienst SW Baden-Württemberg & RP | 2009 | Link |
| OE | Austrian Seismic Network (ZAMG) | 1987 | Link |
| MN | Mediterranean Very Broadband Seismographic Network (MedNet) | 1990 | Link |
Dataset for benchmarking (Benchmark.zip), contains event and noise signals part of "test" partition in hdf5 files.
Contains event and noise signal combinations as used in performance test in publication, with randomly sampled noise (TestDataSet.zip) and impulsive noise signals (TestDataSet_ImpulsiveNoise.zip). The ReadMe File documents test procedure.
Files
Benchmark.zip
Files
(91.6 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:9c65e16066150a1d0b6bd6da82f70a06
|
3.9 GB | Preview Download |
|
md5:5a8809843f15b932c1919eca94a906fa
|
8.3 MB | Preview Download |
|
md5:9b6667030a2a9de2f767e90ec3bc7698
|
31.4 MB | Preview Download |
|
md5:93e9f65345493987602b7c893c3d113f
|
11.7 GB | Download |
|
md5:29586f61066bc34912898d23fab42968
|
76.0 GB | Download |
Additional details
Related works
- References
- Software: 10.5281/zenodo.17935347 (DOI)