There is a newer version of the record available.

Published April 14, 2022 | Version 0.2
Dataset Restricted

Kassel State of Fluency - Challenge

  • 1. Technische Hochschule Nürnberg
  • 2. Kasseler Stottertherapie
  • 3. Friedrich Alexander Universität Erlangen-Nürnberg


  • 1. Technische Hochschule Nürnberg


The stuttering Kassel State of Fluency corpus KSoF-C, provided is derived from
the Kassel State of Fluency (KSoF) corpus. The original corpus fea-
tures some 5500 typical and nontypical (stuttering) 3-sec segments
from 37 German speakers with an overall duration of 4,6 hours. The
segments contain speech of persons who stutter (PWO). The record-
ings from which the segments were extracted were recorded before,
during, and after PWOs underwent stuttering therapy.

KSOF-C only features the 4601 non-ambiguously labeled segments.
The task proposed in this challenge is the classification of speech
segments as one of the 8 classes - the seven stuttering-related classes
mentioned above and an eighth “garbage” class, denoting unintelligible segments, segments containing no speech, or segments that
are negatively affected by loud background noise. The dataset is split by speaker (train, 23 spk, devel, 6 spk, test, 8 spk)



The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Please sign the EULA for the KSF-C dataset first and register your team for the ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE) challenge.

The EULA for the KSF-C dataset can be obtained from 

Access will be granted upon receiving the signed EULA.

You are currently not logged in. Do you have an account? Log in here

Additional details


  • Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Stephen Roberts: "The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitos", Proceedings ACM Multimedia 2022, ACM, Lisbon, Portugal, 2022.
  • Sebastian P. Bayerl, Alexander Wolff von Gudenberg, Florian Hönig, Elmar Nöth, Korbinian Riedhammer: "KSoF: The Kassel State of Fluency Dataset - A Therapy Centered Dataset of Stuttering", Proceedings LREC 2022, Marseille, France, 2022.