Published March 17, 2023 | Version v1
Conference paper Open

SegmentCodeList: Unsupervised Representation Learning for Human Skeleton Data Retrieval

  • 1. Masaryk University, Brno, Czech Republic
  • 2. ISTI-CNR, Pisa, Italy

Description

Recent progress in pose-estimation methods enables the extraction of sufficiently-precise 3D human skeleton data from ordinary videos, which offers great opportunities for a wide range of applications. However, such spatio-temporal data are typically extracted in the form of a continuous skeleton sequence without any information about semantic segmentation or annotation. To make the extracted data reusable for further processing, there is a need to access them based on their content. In this paper, we introduce a universal retrieval approach that compares any two skeleton sequences based on temporal order and similarities of their underlying segments. The similarity of segments is determined by their content-preserving low-dimensional code representation that is learned using the Variational AutoEncoder principle in an unsupervised way. The quality of the proposed representation is validated in retrieval and classification scenarios; our proposal outperforms the state-of-the-art approaches in effectiveness and reaches speed-ups up to 64x on common skeleton sequence datasets.

Notes

This research was supported by ERDF "CyberSecurity, CyberCrime and Critical Information Infrastructures Center of Excellence" (No. CZ.02.1.01/0.0/0.0/16_019/0000822), by AI4Media - A European Excellence Centre for Media, Society, and Democracy (EC, H2020 n. 951911), and by SUN - Social and hUman ceNtered XR (EC, Horizon Europe n. 101092612).

Files

preprint.pdf

Files (596.0 kB)

Name Size Download all
md5:64b6b05a6b16dc22c978e696b6302c6e
596.0 kB Preview Download

Additional details

Funding

European Commission
AI4Media - A European Excellence Centre for Media, Society and Democracy 951911
European Commission
SUN - Social and hUman ceNtered XR 101092612