Published June 1, 2024 | Version v1
Dataset Open

PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance

Description

Recently, artificial intelligence techniques for education have been received increasing attentions, while it still remains an open problem to design the effective music instrument instructing systems. Although key presses can be directly derived from sheet music, the transitional movements among key presses require more extensive guidance in piano performance. In this work, we construct a piano-hand motion generation benchmark to guide hand movements and fingerings for piano playing. To this end, we collect an annotated dataset, PianoMotion10M, consisting of 116 hours of piano playing videos from a bird's-eye view with 10 million annotated hand poses. We also introduce a powerful baseline model that generates hand motions from piano audios through a position predictor and a position-guided gesture generator. Furthermore, a series of evaluation metrics are designed to assess the performance of the baseline model, including motion similarity, smoothness, positional accuracy of left and right hands, and overall fidelity of movement distribution. Despite that piano key presses with respect to music scores or audios are already accessible, PianoMotion10M aims to provide guidance on piano fingering for instruction purposes.

Files

annotation.zip

Files (7.0 GB)

Name Size Download all
md5:db46c11370e63ea7ef318d0ddf207b8e
4.4 GB Preview Download
md5:25da26e68686ee3f5ff97fefea6850a9
2.6 GB Preview Download
md5:0e07b3abf054be649b45e743e368853a
13.9 MB Preview Download
md5:c3c4f717e748cd59344a13f1ba350c6b
21.9 kB Preview Download
md5:2f73c06d5185540d009433aa87ca73a3
19.7 kB Preview Download
md5:ce945d0185006848ea42b29bd5400329
2.2 kB Preview Download

Additional details

Related works

Is described by
Dataset: arXiv:2406.09326 (arXiv)