PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas

Chen Li; Xutan Peng; Teng Wang; Yixiao Ge; Mengyang Liu; Xuyuan Xu; Yexin Wang; Ying Shan

doi:10.5281/zenodo.8012075

Published June 7, 2023 | Version v1

Dataset Open

PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas

1. ARC Lab, Tencent PCG
2. The University of Sheffield
3. AI Technique Center of Tencent Video

Art forms such as movies and television (TV) dramas are reflections of the real world, which have attracted much attention from the multimodal learning community recently. However, existing corpora in this domain share three limitations: i. annotated in a scene-oriented fashion, they ignore the coherence within plots; ii. their text lacks empathy and seldom mentions situational context; iii. their video clips fail to cover long-form relationship due to short duration. To address these fundamental issues, using 1,106 TV drama episodes and 24,875 informative plot-focused sentences written by professionals, with the help of 449 human annotators, we constructed PTVD, the first plot-oriented multimodal dataset in the cinema domain.

Files

BSC_annotations.zip

Files (4.9 GB)

Name	Size
BSC_annotations.zip md5:1ca8fc7d2a78522bd81131c6e33f2283	7.1 MB	Preview Download
feature_clip.zip md5:911686bdb91954aa13a71c488639aba0	4.2 GB	Preview Download
metadata.zip md5:7763cb157c65ff70e3a852f22506eb3f	603.9 MB	Preview Download
plot_annotations.zip md5:74c21b56881e906fe3bfbe8cd85c459d	2.0 MB	Preview Download
subtitle_annotations.zip md5:2883ee0675c88a32645f1af7ef0b9a00	18.2 MB	Preview Download

	All versions	This version
Views	175	175
Downloads	153	153
Data volume	168.8 GB	168.8 GB

PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas

Authors/Creators

Description

Files

BSC_annotations.zip

Files (4.9 GB)