Published October 22, 2024 | Version v1
Presentation Open

DELAD/CLARIN Workshop - Collecting and Sharing Corpora for Language and Speech Disorders

  • 1. Radboud Universiteit Nijmegen
  • 1. Radboud Universiteit Nijmegen
  • 2. Adam Mickiewicz University in Poznań
  • 3. ROR icon Maastricht University
  • 4. ROR icon University of Warsaw
  • 5. ROR icon Radboud University Nijmegen
  • 6. ROR icon King's College London
  • 7. ROR icon Delft University of Technology
  • 8. ROR icon University College Cork
  • 9. ROR icon University of Helsinki
  • 10. ROR icon University of Strathclyde

Description

A DELAD/CLARIN workshop on collecting and sharing corpora for language and speech disorders was organized in conjunction with the 21st International Congress of Linguists (ICL), which took place from 8 to 14 September in Poznan, Poland. The hybrid workshop was held on 11 and 12 September 2024, and the presentations have been recorded and published on the CLARIN YouTube channel.

Researchers specialising in speech disorders corpora were invited to present their work and discuss their data-sharing approaches, including any challenges they faced. The programme featured contributions from DELAD representatives, highlighting the sharing of corpora of speech disorders through DELAD and introducing recent updates, such as a new CLARIN Resource Family page dedicated to corpora involving communication disorders. Additionally, the programme addressed key topics like metadata, which is essential for ensuring dataset findability. It also included a panel discussion on the potential role of large language models (e.g. ChatGPT) in advancing research in this field.

The workshop was organised with financial support from CLARIN ERIC.

Workshop Programme

11 September 2024 

View the recording

  • A CLARIN Resource Family for Corpora of Communication Disorders & Questionnaire about data sharing (Henk van den Heuvel & Satu Saalasti)
  • Challenges in data sharing from a clinical perspective: a use case of voice data from patients with COPD (Loes van Bemmel)
  • Using a portable system for multi-channel audio data acquisition and processing (Anita Lorenc et al.)
  • Corpus-based research into intra- and interpersonal language variation in people with aphasia (Marina Ruiter et al.)
  • Dysarthric speech database in Dutch and English for personalized dysarthric speech recognition (Zhang Yue et al.
  • The Icelandic Language Biobank: Data Collection through a Clinical Analysis Platform (Iris Nowenstein et al.)
  • STAR - A Speech Therapy Animation and Imaging Resource (Eleanor Lawson et al.)

12 September 2024

View the recording

  • Sensitive Data in HPC – How secure can it be? (Matthiesen)
  • The impact of AI on research and treatment of language & speech impairments
    30-minute Introduction (Zhengjun Yue) & 30-minute Panel discussion

 

Notes (En)

DELAD is an initiative that facilitates the sharing of speech corpora among researchers of individuals with communication disorders (CSD). We do this in a GDPR-compliant way and at secure repositories in the CLARIN infrastructure. See our website.

DELAD regularly organises workshops around the following themes:

  • Guidelines for collecting and sharing CSD
  • Ethics and legal aspects
  • Levels of anonymisation
  • Layered access to data
  • Integration of CSD in the CLARIN infrastructure
  • Formats
  • Relevant metadata

For themes and reports of our previous workshops, visit our website https://delad.ruhosting.nl/wordpress/delad-workshops-2017-2020/.

Files

00-DELAD-Poznan-intro.pdf

Files (14.1 MB)

Name Size Download all
md5:dea871da2f9a4ec19066788dae9e22c9
362.6 kB Preview Download
md5:184c25050933650ad2bda9fabb065601
583.6 kB Preview Download
md5:b5e32377783cecdb9fc0d03778aa8944
1.2 MB Preview Download
md5:c2b4472c19a6099a5759d8b398050809
3.4 MB Preview Download
md5:0619412546b17e899a96b18a71f07108
1.0 MB Preview Download
md5:98cae103096247844d5e9267f85777aa
1.3 MB Preview Download
md5:7f97d1d366137f3e7af9d2d3f7c61d89
1.6 MB Preview Download
md5:c294a925599e8d8695053ea0224ce9ab
959.1 kB Preview Download
md5:3b0d63aeb5cd72e77f1ed7db31f9ea62
3.7 MB Preview Download

Additional details

Additional titles

Subtitle (En)
DELAD/CLARIN Workshop at the ICL Conference, Poznan