Published May 26, 2021 | Version v1
Dataset Open

The 3D-video Point Cloud Musicians Dataset

  • 1. University of Music and Performing Arts Vienna

Description

DESCRIPTION
The 3D-video Point Cloud Musicians Dataset provides 3D-video point clouds of several performers playing five different instruments: cello, doublebass, guitar, saxophone, and violin. The full 3D-video recordings span 1 hour of duration with an average of 12 different performers for each instrument. All musicians point clouds are centered in the origin and the axes have the following meaning: the body face direction is the z-axis, stature direction is the y-axis, and the side direction is the x-axis. The provided musician point clouds are down-sampled from their full resolution by using a voxel size of 0.01.

RECORDINGS
Recordings were conducted using a single Azure Kinect DK placed one meter above the floor and capturing a frontal view of the musician at a distance of two meters. The Azure Kinect DK depth camera was capturing a 75°x65° field of view with a 640x576 resolution while the Azure Kinect DK color camera was capturing with a 1920x1080 resolution. Both cameras were recording at 15 fps and Open3D library was then used to align depth and color streams and generate a point cloud for each frame. Each frame is saved as .ply format file.

NAMING CONVENTION
The folders in the dataset follow this structure: Instrument -> Person. The captured frames are named according to their temporal position using five numerical characters, i.e. %5d.ply

Files

frames.zip

Files (14.9 GB)

Name Size Download all
md5:5ddf9d56f5e5ef059481c79c9c8f5946
14.9 GB Preview Download

Additional details

Funding

VRACE – VRACE - Virtual Reality Audio for Cyber Environments 812719
European Commission