Towards objective evaluation of audio time-scale modification methods

Leonardo Fierro; Vesa Välimäki

doi:10.5281/zenodo.3898936

Published June 17, 2020 | Version v1

Conference paper Open

Towards objective evaluation of audio time-scale modification methods

The need for high-quality time-scale modification of audio is increasing, as media streaming services are providing new related functionalities to their users. The main goal of a time-stretching method is to preserve the pitch and the subjective quality of the different components of the audio signal, namely transients, noise, and tonal components. Many solutions have been proposed throughout the years, with various results depending on the kind of processed audio input. This paper introduces an evaluation method for audio time-scaling algorithms based on a recent fuzzy time-frequency decomposition, which reveals the energy of the tonal, transient, and noise components in the original and stretched sounds. From the energy curves, typical impairments, such as transient smearing and the loss of tonality, can be observed. This analysis approach is compared with the subjective preferences of different techniques. This leads to suggestions for possible improvements of future algorithms. The ultimate goal is having an objective evaluation method which matches the subjective quality assessment.

Files

SMCCIM_2020_paper_133.pdf

Files (981.9 kB)

Name	Size	Download all
SMCCIM_2020_paper_133.pdf md5:48992cb61ee95d5b6eebd25d42eebd2c	981.9 kB	Preview Download

	All versions	This version
Views	38	38
Downloads	28	28
Data volume	30.4 MB	30.4 MB

Towards objective evaluation of audio time-scale modification methods

Creators

Description

Files

SMCCIM_2020_paper_133.pdf

Files (981.9 kB)