Scaling Self-Supervised Representation Learning for Symbolic Piano Performance

Louis Bradshaw; Alexander Spangher; Honglu Fan; Stella Biderman; Simon Colton

doi:10.5281/zenodo.17706484

There is a newer version of the record available.

Published September 21, 2025 | Version v1

Conference paper Open

Scaling Self-Supervised Representation Learning for Symbolic Piano Performance

We study the capabilities of generative autoregressive transformer models trained on large amounts of symbolic solo-piano transcriptions. After first pre-training on approximately 60,000 hours of music, we use a comparatively smaller, high-quality subset, to fine-tune models to produce coherent musical generations, perform symbolic classification tasks, and by adapting the SimCLR framework to symbolic music, produce general purpose contrastive MIDI embeddings. The resulting models perform well on a variety of standard benchmarks, demonstrating the generalizability of the autoregressive representations learned during pre-training, often requiring only a few hundred gradient updates to fully specialize to different generative and MIR tasks.

Files

000052.pdf

Files (549.1 kB)

Name	Size	Download all
000052.pdf md5:a0a5568fe45a0c52579b5a8b4cd0f5a2	549.1 kB	Preview Download

191

Views

156

Downloads

Show more details

	All versions	This version
Views	191	112
Downloads	156	128
Data volume	90.1 MB	73.6 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 26th International Society for Music Information Retrieval Conference, 465-473. Daejeon, South Korea.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2025) , Daejeon, South Korea and Online, September 21-25, 2025

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 25, 2025
Modified: November 25, 2025

Scaling Self-Supervised Representation Learning for Symbolic Piano Performance

Authors/Creators

Description

Files

000052.pdf

Files (549.1 kB)