Vimeo Creative Commons Collection (V3C) Whisper Transcripts

Published November 30, 2022 | Version v1

Dataset Open

Automatic transcript for every video in the Vimeo Creative Commons Collection (V3C) generated using OpenAI's Whisper using the 'small' model.

Files

Rossetto, L., Schuldt, H., Awad, G., & Butt, A. A. (2019, January). V3C–a research video collection. In International Conference on Multimedia Modeling (pp. 349-360). Springer, Cham.
Radford, A., Kim, J. W., Xu, T., Brockman, G., McLeavey, C., & Sutskever, I. (2022). Robust speech recognition via large-scale weak supervision. OpenAI Blog.

782

Views

382

Downloads

Show more details

DOI

Resource type

Dataset

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more