Published August 16, 2022 | Version v1
Dataset Open

GENEA Challenge 2022 Dataset Files

  • 1. Ghent University

Description

This Zenodo repository contains the main dataset for the GENEA 2022 challenge, which is based on the Talking With Hands 16.2M data.

Contents:

The "trn" and "val" zip files contain audio files (in WAV format), time-aligned transcriptions (in TSV format), and motion files (in BVH format) for the training and validation datasets, respectively.

The "tst" zip file contains audio files (in WAV format) and transcriptions (in TSV format) for the test set, but no motion. The corresponding test motion is available at:

https://doi.org/10.5281/zenodo.6976463

Each zip file also contains a "metadata.csv" file that contains information for all files regarding the speaker ID and whether or not the motion files contain finger motion.

Note that the speech audio in the data sometimes has been replaced by silence for the purpose of anonymisation.

Data processing scripts:

We provide a number of optional scripts for encoding and processing the challenge data:

Audio: Scripts for extracting basic audio features, such as spectrograms, prosodic features, and mel-frequency cepstral coefficients (MFCCs) can be found at this link.

Text: A script to encode text transcriptions to word vectors using FastText is available. tsv2wordvectors.py

Motion: If you wish to encode the joint angles from the BVH files to and from an exponential map representation, you can use scripts by Simon Alexanderson based on the PyMo library, which are available here:

Attribution:

If you use this material, please cite our latest paper on the GENEA Challenge 2022. At the time of writing (2022-08-16), that is our ACM ICMI 2022 paper:

Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Carla Viegas, Teodor Nikolov, Mihail Tsakov, and Gustav Eje Henter. 2022. The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation. In Proceedings of the ACM International Conference on Multimodal Interaction (ICMI '22). ACM.

You can find the latest information and a BibTeX file on the project website:

https://youngwoo-yoon.github.io/GENEAchallenge2022/

Also cite the paper about the original dataset from Meta Research:

Gilwoo Lee, Zhiwei Deng, Shugao Ma, Takaaki Shiratori, Siddhartha S. Srinivasa, and Yaser Sheikh. 2019. Talking With Hands 16.2M: A large-scale dataset of synchronized body-finger motion and audio for conversational motion analysis and synthesis. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV ’19). IEEE, 763–772.

 

The motion and audio files are based on the Talking With Hands 16.2M dataset at https://github.com/facebookresearch/TalkingWithHands32M/. All material is available under a CC BY NC 4.0 international license, with the text provided in LICENSE.txt.

 

To find more GENEA Challenge 2022 material on the web, please see:

* https://youngwoo-yoon.github.io/GENEAchallenge2022/

* https://genea-workshop.github.io/2022/challenge/

If you have any questions or comments, please contact:

* The GENEA Challenge & Workshop organisers <genea-contact@googlegroups.com

Files

LICENSE.txt

Files (4.7 GB)

Name Size Download all
md5:f5a53c7ab38ba3772e879f1407d3d412
19.3 kB Preview Download
md5:78e71c4ced6f7192cb2a79bb43a2fdbc
2.4 kB Preview Download
md5:ef2902118354f87be86fac35037b5499
4.4 GB Preview Download
md5:e5f79b08c1f7d4a41ac06e87e39c7a5b
114.1 MB Preview Download
md5:e33fdd45744b6780dbd64614828926ae
183.8 MB Preview Download