Dispformer Datasets: Synthetic and Real-World datasets for Surface Wave Dispersion Curve Inversion
Description
Dispformer: Synthetic and Real-World datasets for Surface Wave Dispersion Curve Inversion
This dataset is designed for surface wave dispersion curve inversion, particularly suited for deep learning-based inversion studies. It provides both synthetic and real-world datasets, including global and local models, enabling the evaluation of various inversion methods such as zero-shot and few-shot strategies. Researchers can refer to the DispFormer framework for details on the model and the corresponding code used for inversion tasks. The related paper for DispFormer can be found here.
For further details on how to use the datasets and the associated neural network models, visit the GitHub repository: https://github.com/liufeng2317
Datasets Overview
The LITHO1.0 global synthetic dataset is primarily used for pre-training the model. The Central and Western US Dataset (CWD) and Continental China Dataset (CCD) are used to validate the effectiveness of zero-shot and few-shot strategies. Finally, the datasets retrieved from the China Seismological Reference Model (CSRM) are used to test the model's performance on real-world data.
| Dataset | Samples | Period | Max Depth | Tags | Reference | 
| LITHO1.0 | 40,962 | 1-100 s | 200 km | Global Synthetic | Masters et al., 2014 | 
| CWD | 6,803 | 10 - 60 s | 120 km | Local Synthetic | Shen et al., 2013 | 
| CCD | 4,527 | 5 - 80 s | 200 km | Local Synthetic | Shen et al., 2016 | 
| CSRM | 12,705 | 8 - 70 s | 120 km | Local Real | Xiao et al., 2024 | 
Data Files
- depth_vp_vs_rho.npz: Contains a 1-D velocity model, including depth, P-wave velocity, S-wave velocity, and density.
- depth_vs.npz: Contains a 1-D velocity model with depth and S-wave velocity (used for training the DispFormer).
- lon_lat.npz,- lat_glat_lon.npz: Contain the station locations for the observed data.
- period_phase_group.npz: Contains synthetic and observed dispersion curves, including period, phase velocity, and group velocity.
- Folders train_data,valid_data, andtest_data: Contain data directly used for training, validating, and testing the model
Example of Loading Data Using Python
Here is an example of how to load the data using Python:
import numpy as np data_path = "" # Specify your data path all_disp_loc = np.load(data_path)["data"] # Load data from the file
For more details on how to use the data, please refer to the DispFormer GitHub repository.
References
- Liu, F., Deng, B., Su, R., Bai, L. & Ouyang, W.. DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications[J]. arXiv preprint arXiv:2501.04366, 2025.
- W. Shen, M. H. Ritzwoller, and V. Schulte‐Pelkum, “A 3‐D model of the crust and uppermost mantle beneath the Central and Western US by joint inversion of receiver functions and surface wave dispersion,” JGR Solid Earth, vol. 118, no. 1, pp. 262–276, Jan. 2013, doi: 10.1029/2012JB009602.
- S. C. Griffiths, B. R. Cox, E. M. Rathje, and D. P. Teague, “Surface-wave dispersion approach for evaluating statistical models that account for shear-wave velocity uncertainty,” J. Geotech. Geoenviron. Eng., vol. 142, no. 11, p. 4016061, Nov. 2016, doi: 10.1061/(ASCE)GT.1943-5606.0001552.
- M. E. Pasyanos, T. G. Masters, G. Laske, and Z. Ma, “LITHO1.0: An updated crust and lithospheric model of the Earth,” JGR Solid Earth, vol. 119, no. 3, pp. 2153–2173, Mar. 2014, doi: 10.1002/2013JB010626.
- X. Xiao et al., “CSRM‐1.0: A China Seismological Reference Model,” JGR Solid Earth, vol. 129, no. 9, p. e2024JB029520, Sep. 2024, doi: 10.1029/2024JB029520.
Files
      
        DispFormer.zip
        
      
    
    
      
        Files
         (1.2 GB)
        
      
    
    | Name | Size | Download all | 
|---|---|---|
| md5:83ce99b12a63d6ecbae8990a01da96d0 | 1.2 GB | Preview Download | 
| md5:6eabc47d61963335b4820c2c9279807c | 3.6 kB | Preview Download | 
Additional details
Additional titles
- Subtitle (English)
- DispFormer Training Datasets
Identifiers
Dates
- Created
- 
      2025
              
                Software
              
            
          - Repository URL
- https://github.com/liufeng2317/DispFormer
- Programming language
- Python
- Development Status
- Active