Published July 16, 2025 | Version v1
Model Open

Pre-trained models for paper MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling

Description

README

This repository contains the pre-trained models for our ISMIR 2025 paper:

MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modeling
by Jinging Tang, Xin Wang, Zhe Zhang, Junichi Yamagishi, Geraint Wiggins, and Geroge Fazekas.

Code and instructions for using these pretrained models can be found in the official git repository: https://github.com/nii-yamagishilab/MIDI-VALLE

Please follow the README in the git repository to use the pre-trained models.

COPYING

This pretrained model is licensed under the Creative Commons License: Attribution 4.0 International http://creativecommons.org/licenses/by/4.0/legalcode

Please see LICENSE.txt for the terms and conditions of this pretrained model.

 

ACKNOWLEDGMENTS

This work was supported by the UKRI Centre for Doctoral Training in Artificial Intelligence and Music [grant number EP/S022694/1] and the National Institute of Informatics (NII), Japan. J. Tang is a research student jointly funded by the China Scholarship Council [grant number 202008440382] and Queen Mary University of London. G. Wiggins received funding from the Flemish Government under the "Onderzoeksprogramma Artificiële Intelligentie (AI) Vlaanderen". We thank the reviewers for their valuable feedback, which helped improve the quality of this work.

Files

Files (1.7 GB)

Name Size Download all
md5:42b8038d2c067e325984e9349e02f732
1.5 GB Download
md5:1725963d35d361842afcff3599509218
236.0 MB Download
md5:527dc6cad772ccb187d5bfe5af738204
18.7 kB Download