Published July 2025 | Version v2
Dataset Open

MixAssist

  • 1. ROR icon University of Utah

Description

This dataset contains the complete audio recordings for the MixAssist project, as detailed in the paper "MixAssist: An Audio-Language Dataset for Co-Creative AI Assistance in Music Mixing."

This repository includes two main types of audio files:

  • Raw Session Recordings: The complete, unprocessed audio recordings from the 7 co-creative mixing sessions between expert and amateur music producers. These recordings contain the full dialogue and simultaneous playback from the Digital Audio Workstation (DAW). They are provided to support research in areas like end-to-end conversational speech recognition or fine-grained interaction analysis.

  • Processed Audio Segments: These are the music-only audio segments that have been extracted and temporally aligned with the conversational turns in the main MixAssist dataset. These segments represent the specific audio the participants were discussing at each point in the conversation.  You'll need to download these if you wish to train or fine-tune an audio language model on the MixAssist dataset, as the audio paths provided within the dataset refer back to these audio segments.

The full conversational MixAssist dataset is available on Hugging Face.

Please cite our paper if you use this dataset in your research.

Files

Full_Group_Convos.zip

Files (2.5 GB)

Name Size Download all
md5:69cb7a4b24cdead7654571bbbd6e5527
29.5 MB Preview Download
md5:d689adfd666300ad65111a34824733f4
37.8 MB Preview Download
md5:c0830d20a32b96b52a99bfb509fb5a24
18.3 MB Preview Download
md5:e5edee56bce071a9d58476ab7378c799
22.9 MB Preview Download
md5:41d3383c1877f0d469927525538c9ad8
15.7 MB Preview Download
md5:5be66ec852474857a84e8e7d5eb6220e
23.6 MB Preview Download
md5:8dd24062cd5059b7a4dbfe08ca66f61e
10.8 MB Preview Download
md5:0b98643ea28630bbd318f61452897f8c
2.3 GB Preview Download

Additional details

Related works

Dates

Available
2025-07