MARVEL - D3.1: Multimodal and privacy-aware audio-visual intelligence – initial version

Alexandros Iosifidis

doi:10.5281/zenodo.6821318

There is a newer version of the record available.

Published July 12, 2022 | Version v1

Project deliverable Open

MARVEL - D3.1: Multimodal and privacy-aware audio-visual intelligence – initial version

Alexandros Iosifidis¹

1. AU

This document describes the initial version of the methodologies pro- posed by MARVEL partners towards the realisation of the Audio, Visual and Multimodal AI Subsystem of the MARVEL architecture. These include methods for Sound Event De- tection, Sound Event Localisation and Detection, Automated Audio Captioning, Visual Anomaly Detection, Visual Crowd Counting, Audio-Visual Crowd Counting, as well as methodologies for improving the training and efficiency of AI models under supervised, unsupervised, and cross-modal contrastive learning settings. The effectiveness of these methods is compared against recent baselines, towards achieving the AI methodology- related objectives of the MARVEL project.

Files

MARVEL-d3.1.pdf

Files (9.1 MB)

Name	Size	Download all
MARVEL-d3.1.pdf md5:d8880d78a7afc4f30cee300324f8bfbb	9.1 MB	Preview Download

Additional details

European Commission
MARVEL - Multimodal Extreme Scale Data Analytics for Smart Cities Environments 957337

405

Views

519

Downloads

Show more details

	All versions	This version
Views	405	291
Downloads	519	390
Data volume	4.9 GB	3.8 GB

More info on how stats are collected....

DOI

Resource type

Project deliverable

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: July 12, 2022
Modified: July 16, 2024

MARVEL - D3.1: Multimodal and privacy-aware audio-visual intelligence – initial version

Creators

Description

Files

MARVEL-d3.1.pdf

Files (9.1 MB)

Additional details

Funding