Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published January 15, 2021 | Version 1.1
Software Open

LingMetaX

  • 1. Leiden University

Description

This script converts linguistic metadata from a .csv format produced by KoBoToolbox to the XML format used by the Lameta linguistic metadata editor. This is useful for linguists who are using the ODK Metadata Method to create linguistic metadata during fieldwork.

The script is available in two versions:

  • Google Colab Notebook - This interactive online version can be run without any installation or setup.
  • Python Script - This version must be downloaded and run on your own computer. You must have Python and the pandas library installed.

Dependencies: pandas

Data Preparation: The script accepts two .csv files as input:

  • Participants.csv - Contains metadata about the participants who have contributed to the creation of linguistic data.
  • Sessions.csv - Contains metadata about recording sessions involving participants.

Metadata must be formatted according to the provided templates provided in this project. See the CSV Template Descriptions, the list of speech genres, and the list of participant roles for more information. If you have used one of the XLS form templates from the ODK Metadata Method and exported data from KoBoToolbox, then the data will already be in the correct format.

Using the downloadable script: The downloadable Python script version also has the capability of automatically organizing resource files together with metadata. For this feature to work correctly, the following must hold true:

  1. Filenames for all resource files should be identical to the corresponding session ID in the session metadata
  2. Resource files for consent should consist of the participant’s full name followed by “_Consent” (e.g. Mariamu Anyawire_Consent.wav)
  3. Photos of participants should consist of the participant’s full name followed by “_Photo” (e.g. Mariamu Anyawire_Photo.jpg).

Download the script and place it in the same folder as the metadata files, and either in the same folder as the resource files or in the parent folder of the folders which contain the resource files. Python and pandas must be installed in order to successfully run the script.

Files

CSV Template Descriptions.md

Files (25.8 kB)

Name Size Download all
md5:9a88dd4225c40127aeb118421d71d620
6.8 kB Preview Download
md5:9cd928e65ae80c94914f42843374c348
13.9 kB Download
md5:e9d9485c640db0059f6556329260b2e9
433 Bytes Preview Download
md5:c8df7179254841163f9bf6cb7a9151d8
490 Bytes Preview Download
md5:ad061e209bfad134d734f269d95ec4c6
2.9 kB Preview Download
md5:f99738a1a89f5dd426b6c90f5894d551
1.0 kB Preview Download
md5:88b874ba3c4e10839e90c0555bb350ce
332 Bytes Preview Download

Additional details