Published May 26, 2025 | Version 1.0
Dataset Open

PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing

  • 1. ROR icon University of Rochester
  • 2. ROR icon Brno University of Technology

Description

This is part of the dataset we curated based on VCTK to study partial speech deepfake detection in the era of neural speech editing. For more details, please refer to our Interspeech 2025 paper: "PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing".

In the paper, we curated four subsets: E1: VoiceCraft, E2: SSR-Speech, E3: Audiobox-Speech, and E4: Audiobox. Adhering to Audiobox's license, we cannot release the E3 and E4 subsets.

The folder structure is as follows:

PartialEdit/
├── PartialEdit_E1E2.csv
├── E1/
│ ├── p225/
│ │ ├── p225_001_edited_partial_16k.wav
│ │ ├── p225_002_edited_partial_16k.wav
│ │ └── ...
│ ├── p231/
│ │ ├── p231_001_edited_partial_16k.wav
│ │ ├── p231_002_edited_partial_16k.wav
│ │ └── ...
│ └── ...
├── E1-Codec/
│ └── (same structure as E1)
├── E2/
│ └── (same structure as E1)
├── E2-Codec/
│ └── (same structure as E1)
└── modified_txt/
├── p225/
│ ├── p225_001_modified.txt
│ ├── p225_002_modified.txt
│ ├── p225_003_modified.txt
│ └── ...
├── p231/
│ ├── p231_001_modified.txt
│ ├── p231_002_modified.txt
│ └── ...
└── ...

This is version 1.0, and we will include links to the paper and demo page soon.

The `PartialEdit_E1E2.csv` file contains information about the edited regions in each audio file. Each row represents the following columns:

- `filename`: The name of the audio file.
- `start of the edited region (s)`: The starting time (in seconds) of the first edited region.
- `end of the edited region (s)`: The ending time (in seconds) of the first edited region.
- `total duration (s)`: The total duration (in seconds) of the audio file.

If there are two edited regions within a file, the row format expands to include:

- `filename`: The name of the audio file.
- `start of the edited region (s)`: The starting time (in seconds) of the first edited region.
- `end of the edited region (s)`: The ending time (in seconds) of the first edited region.
- `start of the second edited region (s)`: The starting time (in seconds) of the second edited region.
- `end of the second edited region (s)`: The ending time (in seconds) of the second edited region.
- `total duration (s)`: The total duration (in seconds) of the audio file.

To make sure the download is complete, you can check the MD5 code with the following command:

md5sum *

Files

PartialEdit_E1E2.csv

Files (21.1 GB)

Name Size Download all
md5:d14854d7a454f60a4e748ab0082549fe
3.4 GB Download
md5:aa8f1636ec374d5101eef6c51afd757f
3.5 GB Download
md5:fc30eb9380a841a37465357dd9c4ef68
9.0 GB Download
md5:702d8be8cdbbe37dd507abd87ebdd6c6
5.2 GB Download
md5:4660ce5d6e38c77448d8eb15d7ae1402
1.9 MB Download
md5:e667aa0bfd728cf26618f6dd6cc683d0
5.8 MB Preview Download

Additional details

Related works

Is derived from
Dataset: 10.7488/ds/2645 (DOI)