ACOUSTIC SCENE CLASSIFICATION: FROM A HYBRID CLASSIFIER TO DEEP LEARNING

Anastasios Vafeiadis; Dimitris Kalatzis; Konstantinos Votis; Dimitrios Giakoumis; Dimitrios Tzovaras; Liming Chen; Raouf Hamzaoui

doi:10.5281/zenodo.3760448

Published November 20, 2017 | Version v1

Conference paper Open

ACOUSTIC SCENE CLASSIFICATION: FROM A HYBRID CLASSIFIER TO DEEP LEARNING

1. Center for Research and Technology Hellas
2. De Montfort University

This report describes our contribution to the 2017 Detection and
Classification of Acoustic Scenes and Events (DCASE) challenge.
We investigated two approaches for the acoustic scene classification task. Firstly, we used a combination of features in the time
and frequency domain and a hybrid Support Vector Machines -
Hidden Markov Model (SVM-HMM) classifier to achieve an average accuracy over 4-folds of 80.9% on the development dataset
and 61.0% on the evaluation dataset. Secondly, by exploiting data-
augmentation techniques and using the whole segment (as opposed
to splitting into sub-sequences) as an input, the accuracy of our
CNN system was boosted to 95.9%. However, due to the small
number of kernels used for the CNN and a failure of capturing the
global information of the audio signals, it achieved an accuracy of
49.5% on the evaluation dataset. Our two approaches outperformed
the DCASE baseline method, which uses log-mel band energies for
feature extraction and a Multi-Layer Perceptron (MLP) to achieve
an average accuracy over 4-folds of 74.8%

Files

DCASE2017Workshop_Vafeiadis_135.pdf

Files (388.9 kB)

Name	Size	Download all
DCASE2017Workshop_Vafeiadis_135.pdf md5:2c8fd0952c3d16f9b14e6f04739095e8	388.9 kB	Preview Download

Additional details

European Commission
ACROSSING - Advanced TeChnologies and PlatfoRm fOr Smarter ASsisted LivING 676157

	All versions	This version
Views	144	45
Downloads	67	20
Data volume	23.4 MB	7.8 MB

ACOUSTIC SCENE CLASSIFICATION: FROM A HYBRID CLASSIFIER TO DEEP LEARNING

Authors/Creators

Description

Files

DCASE2017Workshop_Vafeiadis_135.pdf

Files (388.9 kB)

Additional details

Funding