Multimodal Fusion of Appearance Features, Optical Flow and Accelerometer Data for Speech Detection

Panagiotis Giannakeris; Stefanos Vrochidis; Ioannis Kompatsiaris

doi:10.5281/zenodo.3514713

Published October 21, 2019 | Version v1

Conference paper Open

Multimodal Fusion of Appearance Features, Optical Flow and Accelerometer Data for Speech Detection

1. CERTH-ITI

In this paper we examine the task of automatic detection of speech without microphones, using an overhead camera and wearable accelerometers. For this purpose, we propose the extraction of hand-crafted appearance and optical flow features from the video modality, and time-domain features from the accelerometer data. We evaluate the performance of the separate modalities in a large dataset of over 25 hours of standing conversation between multiple individuals. Finally, we show that applying a multimodal late fusion technique can lead to a performance boost in most cases.

Files

giannakeris2019noaudio.pdf

Files (459.8 kB)

Name	Size	Download all
giannakeris2019noaudio.pdf md5:8b918e5baaa6f73dcdb1a386ece9eb21	459.8 kB	Preview Download

Additional details

European Commission
SUITCEYES - Smart, User-friendly, Interactive, Tactual, Cognition-Enhancer that Yields Extended Sensosphere - Appropriating sensor technologies, machine learning, gamification and smart haptic interfaces 780814

156

Views

113

Downloads

Show more details

	All versions	This version
Views	156	155
Downloads	113	112
Data volume	55.2 MB	54.7 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

MediaEval 2019 Workshop , Sophia Antipolis, France, 27-29 October 2019 (Session No-Audio Multimodal Speech Detection task)

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: October 21, 2019
Modified: July 22, 2024

Multimodal Fusion of Appearance Features, Optical Flow and Accelerometer Data for Speech Detection

Creators

Description

Files

giannakeris2019noaudio.pdf

Files (459.8 kB)

Additional details

Funding