Published July 27, 2021 | Version The data are licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0). This license allows you to use the data free of charge for non-commercial purposes. You may modify and redistribute the data as long as you keep the attribution to the original in all the files, publish your work under the same licencse and cite Stephanie Gross, Matthias Hirschmanner, Brigitte Krenn, Friedrich Neubarth, Michael Zillich:
Dataset Open

Action Verb Corpus (AVC)

  • 1. Austrian Research Institute for Artificial Intelligence
  • 2. Technical University Vienna (ACIN)

Description

The Action Verb Corpus comprises multimodal data of 46 episodes (recordings) conducted by 12 humans with in total 390 instances of simple actions -- take, put, and push. Recorded are audio, video and motion data (hand and arm) while participants perform an action and describe what they do. Details about how the data was collected can be found in Stephanie Gross, Matthias Hirschmanner, Brigitte Krenn, Friedrich Neubarth, Michael Zillich: Action Verb Corpus. LREC 2018. An extension to AVC focusing on visual action recognition is available here.

The dataset consists of the following information:

  • the merged output of the object trackers, including the object poses and their reliability estimate calculated by the object tracker, whether an object is touched by or is in the hand of the instructor and whether the object touches the table, for the coordinate system applied see picture "coord_system.png" (one csv file per episode/recording, named Objects.csv),
  • the head, hand and arm positions, including per frame the 3D positions of the joints in the elbow, wrist, and knuckles of the instructor's hands (one csv file per episode/recording, named Hands.csv), the interpretation of the figures is as followes:
    • HandID: 0 right, 1 left
    • FingerID: 0 thumb, 1 index, 2 middle, 3 ring, 4 pinky
    • BoneID: 0 metacarpal, 1 proximal, 2 intermediate, 3 distal
  • the merged hand and object positions (one file per episode/recording, named Merged.csv),
  • the videos from leap motion showing the hand movements and objects (one avi file per episode/recording, named HandsObjects_libm.avi),
  • an animation of the merged hand and object tracking (one avi file per episode/recording),
  • the following list of annotations synchronized with the real-time animation of the hand and object tracking and with the speech stream (one eaf (ELAN) and one csv file per episode/recording)
    • manual orthographic transcriptions and translitations of utterances,
    • part-of-speech tags, automatically generated with the Tree-Tagger (Schmid 1995) and manually corrected,
    • lemmata, automatically generated with the Tree-Tagger and manually corrected,
    • information which object is currently moved, and where it is moved to (manually annotated),
    • information whether the left or right hand touches a particular object (manually annotated),
    • information whether a particular object touches the ground/table (automatically identified by the object tracker and manually corrected),
    • position of stationary objects in the scene (automatically calculated from output of object tracker),

File list in avc.tar.gz

  • Readme
  • Picture: Experimental setup and coordinate system. (Note, the origin of the coordinate system for the positions in the Hands and Objects files is on the table infront of the instructor.)
  • Objects Files: AVC_Objects.zip
  • Hands Files: AVC_Hands.zip
  • Hands and objects merged in one file: AVC_merged.zip
  • Videos from the leap motion showing the hand movements and objects: AVC_video_libm.zip
  • Animations of the merged hand and object trackings: AVC_video_schematic.zip
  • Elan files: AVC_eaf.zip
  • Csv files exported from Elan: AVC_csv.zip
  • Reference paper: Stephanie Gross, Matthias Hirschmanner, Brigitte Krenn, Friedrich Neubarth, Michael Zillich: Action Verb Corpus. LREC 2018.

Acknowledgments

Corpus creation and annotation was supported by the WWTF project RALLI and the CHIST-ERA HLU project ATLANTIS. The dataset was recorded at ACIN, TUW.

Files

AVC_csv.zip

Files (590.7 MB)

Name Size Download all
md5:13ff5c0142a9ea8881281f0733f6ebab
162.0 kB Preview Download
md5:6d0a73294fe190f91a382d16aa80d782
413.8 kB Preview Download
md5:262b82a5c5cd3d50563505f19795d890
137.4 MB Preview Download
md5:2a282fb70377fcda6f38b43414591459
147.8 MB Preview Download
md5:de7df1594925c8d74932a58771a16a87
7.1 MB Preview Download
md5:cd67292d5998c86df57befdb9b1f9435
192.6 MB Preview Download
md5:c22df5491128f15418d2467690db75e4
104.7 MB Preview Download
md5:ee6685811130622dfb7ef7d22a0c88e9
207.5 kB Preview Download
md5:254ed986a8e0b5c483a6ccf9f87853f1
280.7 kB Preview Download
md5:b880f37e022bf905a8cc8eed6e58fd64
3.4 kB Preview Download