TRACK: A New Method from a Re-examination of Deep Architectures for Head Motion Prediction in 360-degree Videos

doi:10.1109/TPAMI.2021.3070520

Published April 5, 2021 | Version v1

Journal article Open

TRACK: A New Method from a Re-examination of Deep Architectures for Head Motion Prediction in 360-degree Videos

1. Université Côte d'Azur, CNRS, I3S
2. Université Côte d'Azur, CNRS, I3S and IUF
3. Université Côte d'Azur, CNRS, INRIA, I3S

We consider predicting the user's head motion in 360° videos, with 2 modalities only: the past user's positions and the video content (not knowing other users' traces). We make two main contributions. First, we re-examine existing deep-learning approaches for this problem and identify hidden flaws from a thorough root-cause analysis. Second, from the results of this analysis, we design a new proposal establishing state-of-the-art performance.
First, re-assessing the existing methods that use both modalities, we obtain the surprising result that they all perform worse than baselines using the user’s trajectory only. A root-cause analysis of the metrics, datasets and neural architectures shows in particular that (i) the content can inform the prediction for horizons longer than 2 to 3 sec. (existing methods consider shorter horizons), and that (ii) to compete with the baselines, it is necessary to have a recurrent unit dedicated to process the positions, but this is not sufficient.
Second, from a re-examination of the problem supported with the concept of Structural-RNN, we design a new deep neural architecture, named TRACK. TRACK achieves state-of-the-art performance on all considered datasets and prediction horizons, outperforming competitors by up to 20% on focus-type videos and horizons 2-5 seconds.

The entire framework (codes and datasets) is online and received an ACM reproducibility badge https://gitlab.com/miguelfromeror/head-motion-prediction

Files

final_TPAMI_2021.pdf

Files (1.7 MB)

Name	Size	Download all
final_TPAMI_2021.pdf md5:25236be7cbb21f0aabc76b3687dbd645	1.7 MB	Preview Download

Additional details

AI4Media – A European Excellence Centre for Media, Society and Democracy 951911: European Commission

	All versions	This version
Views	388	384
Downloads	339	333
Data volume	617.1 MB	606.7 MB

TRACK: A New Method from a Re-examination of Deep Architectures for Head Motion Prediction in 360-degree Videos

Creators

Description

Files

final_TPAMI_2021.pdf

Files (1.7 MB)

Additional details

Funding