Published November 30, 2022 | Version v1
Dataset Open

Vimeo Creative Commons Collection (V3C) Whisper Transcripts

  • 1. University of Zurich
  • 2. University of Basel

Description

Automatic transcript for every video in the Vimeo Creative Commons Collection (V3C) generated using OpenAI's Whisper using the 'small' model.

Files

V3C1.zip

Files (153.4 MB)

Name Size Download all
md5:a439f70eec99056983817f6228f92edc
40.0 MB Preview Download
md5:8c483311f160c08b5ecf69cfb9cd19cd
51.9 MB Preview Download
md5:fdf36a498fab50e4de9e0223cf7aaa6d
61.6 MB Preview Download

Additional details

References

  • Rossetto, L., Schuldt, H., Awad, G., & Butt, A. A. (2019, January). V3C–a research video collection. In International Conference on Multimedia Modeling (pp. 349-360). Springer, Cham.
  • Radford, A., Kim, J. W., Xu, T., Brockman, G., McLeavey, C., & Sutskever, I. (2022). Robust speech recognition via large-scale weak supervision. OpenAI Blog.