Published August 30, 2021 | Version 1.1
Dataset Open

ASVspoof2019LA-Sim: Augmented Dataset for An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems

  • 1. University of Rochester
  • 2. Beijing Institute of Technology

Description

This is the dataset we augmented to study the channel effects for anti-spoofing. For more details, please refer to our Interspeech 2021 paper: "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems".

Proceeding: https://www.isca-speech.org/archive/interspeech_2021/zhang21ea_interspeech.html

Arxiv: https://arxiv.org/pdf/2104.01320.pdf

Code: https://github.com/yzyouzhang/Empirical-Channel-CM

Contact: you.zhang@rochester.edu

Version 1.0 contains the training and the development set. We have added the evaluation set in version 1.1 but deleted the training set due to the size limitation, but you can still access the training set in version 1.0.

Please check it out.

To extract the files, please use the following commands:

cat eval.tar.gz-part* > eval.tar.gz

tar -xvzf *.tar.gz

After concatenation, to make sure the download is complete, you can check with the following:

md5sum *.tar.gz

15dea7d28b126994bb6b159778f706af  dev.tar.gz
0615052b34ca6c7f58505eaa8647844f  eval.tar.gz
3058dd9d407f3c9ae697acca8c34a6c3  train.tar.gz

Thanks.

Files

Files (98.0 GB)

Name Size Download all
md5:15dea7d28b126994bb6b159778f706af
26.4 GB Download
md5:ae8361ed93f91b9b62f84afdc1dd328d
48.3 GB Download
md5:4034eda6e3b6c9a9393dece30143ef73
23.2 GB Download

Additional details

Funding

BIGDATA: F: Audio-Visual Scene Understanding 1741472
National Science Foundation