Published April 14, 2025 | Version v1
Presentation Open

LiRI Corpus Platform: Data analysis using text, audio, and video corpora

Authors/Creators

  • 1. ROR icon University of Zurich

Description

The LiRI Corpus Platform (LCP) provides an integrated environment for working with annotated linguistic data across multiple modalities. It supports not only large text corpora, but also resources that include time-aligned annotations on audio and video streams.

In this session, participants explore how LCP enables multimodal analysis: from examining syntactic and semantic structures in text to navigating spoken interaction and gesture captured in time-based media. By utilizing the example corpora, we demonstrate how annotations are represented and queried across modalities. The session also shows the relationship between how data is structured and the expressions on LCP’s DQD querying language.

 

The CLARIN-CH Training Sessions 2025: Exploring Swiss Language Resources and Tools

This training series took place during the spring semester 2025 and was organized by members of the CLARIN-CH ecosystem of infrastructures with the aim to introduce and deepen participants' knowledge of the Swiss national infrastructure for language data.

Files

CLARIN-CH_Training_20250414_Vukovic_Presentation.pdf

Files (1.5 GB)

Additional details

Related works

Is documented by
Report: 10.5281/zenodo.15826674 (DOI)

Funding

Swissuniversities