Published September 19, 2025 | Version 0
Dataset Open

CodecDeepfakeDetection

Description

A dataset for research on codec-based deepfake detection. A related paper is under review.

The huggingface dataset version can be found here

Files

README.txt

Files (14.0 GB)

Name Size Download all
md5:0970486b8f4fda320b4f00199dbf47d1
1.2 GB Download
md5:567ee72806e754629db84264e9b54ba5
692.2 MB Download
md5:bfca1af1e7a00b8a5cc91d034f6735f3
919.2 MB Download
md5:ca0725ff436bf649a61f9ec9447d9164
4.0 GB Download
md5:8a06eb8e24dbb8ba6824c8bc083ef4c5
7.2 GB Download
md5:b9ed403c7b69b912742369f6f15cdc8e
330.2 kB Download
md5:db1d6d874d345b9b1ee29cbd866eb386
6.3 kB Preview Download

Additional details

References

  • [1] Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md Sahidullah, Tomi Kinnunen, Nicholas Evans, Kong Aik Lee, Junichi Yamagishi, Myeonghun Jeong, Ge Zhu, Yongyi Zang, You Zhang, Soumi Maiti, Florian Lux, Nicolas Müller, Wangyou Zhang, Chengzhe Sun, Shuwei Hou, Siwei Lyu, Sébastien Le Maguer, Cheng Gong, Hanjie Guo, Liping Chen, and Vishwanath Singh. 2024. ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech. In Computer Speech & Language, 2026, 95. Jg., S. 101825. https://www.sciencedirect.com/science/article/pii/S0885230825000506
  • [2] Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md Sahidullah, Tomi Kinnunen, Nicholas Evans, Kong Aik Lee, and Junichi Yamagishi. 2024. ASVspoof 5: Crowdsourced speech data, deepfakes, and adversarial attacks at scale. In ASVspoof Workshop 2024, 2024. 1--8. https://doi.org/10.21437/ASVspoof.2024-1