Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published November 13, 2023 | Version v1
Conference paper Open

Reconstructing Human Expressiveness in Piano Performances with a Transformer Network

Description

Capturing intricate and subtle variations in human expressiveness in music performance using computational approaches is challenging. In this paper, we propose a novel approach for reconstructing human expressiveness in piano performance with a multi-layer bi-directional Transformer encoder. To address the needs for large amounts of accurately captured and score-aligned performance data in training neural networks, we use transcribed scores obtained from an existing transcription model to train our model. We integrate pianist identities to control the sampling process and explore the ability of our system to model variations in expressiveness for different pianists. The system is evaluated through statistical analysis of generated expressive performances and a listening test. Overall, the results suggest that our method achieves state-of-the-art in generating human-like piano performances from transcribed scores, while fully and consistently reconstructing human expressiveness poses further challenges. Our codes are released at https://github.com/BetsyTang/RHEPP-Transformer.

Files

cmmr2023_2b-3.pdf

Files (1.7 MB)

Name Size Download all
md5:68b7b9cb06f55d73f81fd1e2ae41be85
1.7 MB Preview Download