DAPS (Device and Produced Speech) Dataset

Mysore, Gautham J.

doi:10.5281/zenodo.4660670

Published May 20, 2014 | Version 1.0

Dataset Open

DAPS (Device and Produced Speech) Dataset

Mysore, Gautham J.¹

1. Adobe Research

The DAPS (Device and Produced Speech) dataset is a collection of aligned versions of professionally produced studio speech recordings and recordings of the same speech on common consumer devices (tablet and smartphone) in real-world environments. It has 15 versions of audio (3 professional versions and 12 consumer device/real-world environment combinations). Each version consists of about 4 1/2 hours of data (about 14 minutes from each of 20 speakers). Please see this paper for a detailed description of the dataset:

Gautham J. Mysore, “Can We Automatically Transform Speech Recorded on Common Consumer Devices in Real-World Environments into Professional Production Quality Speech? - A Dataset, Insights, and Challenges”, in the IEEE Signal Processing Letters, Vol. 22, No. 8, August 2015

The primary goal of the dataset is to help develop methods to automatically convert real-world device recordings into professional sounding recordings. It can be also used for various other applications like voice conversion, traditional speech enhancement, and automatic production of studio recordings.

Files

Files (16.1 GB)

Name	Size	Download all
daps.tar.gz md5:303c130b7ce2e02b59c7ca5cd595a89c	16.1 GB	Download

Additional details

Is supplement to: Journal article: 10.1109/LSP.2014.2379648 (DOI)

	All versions	This version
Views	10,800	10,710
Downloads	5,205	5,192
Data volume	171.5 TB	171.3 TB

DAPS (Device and Produced Speech) Dataset

Authors/Creators

Description

Files

Files (16.1 GB)

Additional details

Related works