There is a newer version of the record available.

Published May 30, 2023 | Version v1
Dataset Open

MeetingBank: A Benchmark Dataset for Meeting Summarization

Description

MeetingBank, a benchmark dataset created from the city councils of 6 major U.S. cities to supplement existing datasets. It contains 1,366 meetings with over 3,579 hours of video, as well as transcripts, PDF documents of meeting minutes, agenda, and other metadata. On average, a council meeting is 2.6 hours long and its transcript contains over 28k tokens, making it a valuable testbed for meeting summarizers and for extracting structure from meeting videos. The datasets contains 6,892 segment-level summarization instances for training and evaluating of performance.

Files

MeetingBank-Metadata.zip

Files (93.2 MB)

Name Size Download all
md5:9c1ccd37f8d7f4a1839cc4834198166a
93.2 MB Preview Download

Additional details

Related works

Is published in
Conference paper: https://arxiv.org/abs/2305.17529 (URL)