A Modular Deep Learning Framework for Scene Understanding in Augmented Reality Applications

Li, Vladislav; Villarini, Barbara; Nebel, Jean-Christophe; Argyriou, Vasileios

doi:10.1109/IAICT59002.2023.10205667

Published August 9, 2023 | Version v1

Conference paper Open

A Modular Deep Learning Framework for Scene Understanding in Augmented Reality Applications

1. Kingston University

Taking as input natural images and videos, augmented reality (AR) applications aim to enhance the real world with superimposed digital contents, enabling interaction between the user and the environment. One important step in this process is automatic scene analysis and understanding, which should be performed both in real time and with a good level of object recognition accuracy. In this work, an end-to-end framework based on the combination of a Super Resolution network with a detection and recognition deep network has been proposed to increase performance and lower processing time. This novel approach has been evaluated on two different datasets: the popular COCO dataset, whose real images are used for benchmarking many different computer vision tasks, and a generated dataset with synthetic images recreating a variety of environmental, lighting, and acquisition conditions. The evaluation analysis is focused on small objects, which are more challenging to correctly detect and recognise. The results show that the Average Precision is higher for small and low-resolution objects for the proposed end-to-end approach in most of the selected conditions.

Files

A_Modular_Deep_Learning_Framework_for_Scene_Understanding_in_Augmented_Reality_Applications.pdf

Files (5.5 MB)

Name	Size	Download all
A_Modular_Deep_Learning_Framework_for_Scene_Understanding_in_Augmented_Reality_Applications.pdf md5:3de2d0b4a48e44ac6801afa14306b1da	5.5 MB	Preview Download

Additional details

European Commission
TALON - Autonomous and Self-organized Artificial Intelligent Orchestrator for a Greener Industry 4.0 101070181

	All versions	This version
Views	13	13
Downloads	12	12
Data volume	82.5 MB	82.5 MB

A Modular Deep Learning Framework for Scene Understanding in Augmented Reality Applications

Authors/Creators

Description

Files

A_Modular_Deep_Learning_Framework_for_Scene_Understanding_in_Augmented_Reality_Applications.pdf

Files (5.5 MB)

Additional details

Funding