PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing
Description
This is part of the dataset we curated based on VCTK to study partial speech deepfake detection in the era of neural speech editing. For more details, please refer to our Interspeech 2025 paper: "PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing".
In the paper, we curated four subsets: E1: VoiceCraft, E2: SSR-Speech, E3: Audiobox-Speech, and E4: Audiobox. Adhering to Audiobox's license, we cannot release the E3 and E4 subsets.
The folder structure is as follows:
PartialEdit/
├── PartialEdit_E1E2.csv
├── E1/
│ ├── p225/
│ │ ├── p225_001_edited_partial_16k.wav
│ │ ├── p225_002_edited_partial_16k.wav
│ │ └── ...
│ ├── p231/
│ │ ├── p231_001_edited_partial_16k.wav
│ │ ├── p231_002_edited_partial_16k.wav
│ │ └── ...
│ └── ...
├── E1-Codec/
│ └── (same structure as E1)
├── E2/
│ └── (same structure as E1)
├── E2-Codec/
│ └── (same structure as E1)
└── modified_txt/
├── p225/
│ ├── p225_001_modified.txt
│ ├── p225_002_modified.txt
│ ├── p225_003_modified.txt
│ └── ...
├── p231/
│ ├── p231_001_modified.txt
│ ├── p231_002_modified.txt
│ └── ...
└── ...
This is version 1.0, and we will include links to the paper and demo page soon.
The `PartialEdit_E1E2.csv` file contains information about the edited regions in each audio file. Each row represents the following columns:
- `filename`: The name of the audio file.
- `start of the edited region (s)`: The starting time (in seconds) of the first edited region.
- `end of the edited region (s)`: The ending time (in seconds) of the first edited region.
- `total duration (s)`: The total duration (in seconds) of the audio file.
If there are two edited regions within a file, the row format expands to include:
- `filename`: The name of the audio file.
- `start of the edited region (s)`: The starting time (in seconds) of the first edited region.
- `end of the edited region (s)`: The ending time (in seconds) of the first edited region.
- `start of the second edited region (s)`: The starting time (in seconds) of the second edited region.
- `end of the second edited region (s)`: The ending time (in seconds) of the second edited region.
- `total duration (s)`: The total duration (in seconds) of the audio file.
To make sure the download is complete, you can check the MD5 code with the following command:
md5sum *
Files
PartialEdit_E1E2.csv
Files
(21.1 GB)
Name | Size | Download all |
---|---|---|
md5:d14854d7a454f60a4e748ab0082549fe
|
3.4 GB | Download |
md5:aa8f1636ec374d5101eef6c51afd757f
|
3.5 GB | Download |
md5:fc30eb9380a841a37465357dd9c4ef68
|
9.0 GB | Download |
md5:702d8be8cdbbe37dd507abd87ebdd6c6
|
5.2 GB | Download |
md5:4660ce5d6e38c77448d8eb15d7ae1402
|
1.9 MB | Download |
md5:e667aa0bfd728cf26618f6dd6cc683d0
|
5.8 MB | Preview Download |
Additional details
Related works
- Is derived from
- Dataset: 10.7488/ds/2645 (DOI)