Presentation Open Access
Henk van den Heuvel;
Christoph Draxler
{ "publisher": "Zenodo", "DOI": "10.5281/zenodo.3694223", "language": "eng", "title": "SSHOC Webinar: CLARIN Hands-on Tutorial on Transcribing Interview Data", "issued": { "date-parts": [ [ 2020, 3, 2 ] ] }, "abstract": "<p><strong>A high-quality orthographic transcript is the basis for all types of analyses of spoken language data. However, transcribing speech is a time-consuming and tedious task. But automatic speech recognition as well as NLP and text annotation tools can make this task much quicker and save you a lot of time and frustration.</strong></p>\n\n<p>In this first of a series of <a href=\"https://www.sshopencloud.eu/\">SSHOC</a> webinars, organised by the consortium partner <a href=\"https://www.clarin.eu/\"><strong>CLARIN ERIC,</strong></a> we will discuss the theoretical basis and the technology available for transcribing spoken language. In particular, we will focus on the role of automatic speech recognition – what are the opportunities, what are the pitfalls and to where can it be applied successfully.</p>\n\n<p><strong>ABOUT THE WEBINAR</strong></p>\n\n<p><em><strong>Introduction to working with interview data.</strong></em> Henk van den Heuvel - Head of the Humanities Lab at the Faculty of Arts, Radbound University Nijmegen - will briefly introduce the topic of using interviews as research instrument, and the cross-disciplinary nature of using interview data. He will also give some background on the Transcription Chain initiative which originated from oral history research but has a much larger potential. </p>\n\n<p><em><strong>Demonstration of automatic transcription of speech.</strong></em> Christoph Draxler - Researcher at the Institute of Phonetics and Speech Processing at Ludwig Maximilian University Munich - will demonstrate the web portal for the automatic transcription of speech. This portal currently supports three languages (English, German, Dutch), with Italian and Czech in the pipeline. The portal provides a user-friendly interface, so that researchers without a technical background may use state-of-the-art recognizers, optimized annotation editors and powerful segmentation services to result in high-quality time-aligned transcripts. These transcripts are the basis for the following in-depth scientific analysis, e. g. topic modeling, linguistic structures, named entity recognition.</p>", "author": [ { "family": "Henk van den Heuvel" }, { "family": "Christoph Draxler" } ], "type": "speech", "id": "3694223" }
All versions | This version | |
---|---|---|
Views | 615 | 615 |
Downloads | 319 | 318 |
Data volume | 1.4 GB | 1.4 GB |
Unique views | 549 | 549 |
Unique downloads | 290 | 289 |