Published June 11, 2024 | Version v1
Dataset Open

ViLCo: VIdeo Language COntinual learning Benchmark

  • 1. University of New South Wales
  • 2. ROR icon DEVCOM Army Research Laboratory

Description

We introduce the first VIdeo Language COntinual learning Benchmark (ViLCo-Bench). Video language continual learning involves continuously adapting to information from video and text inputs, enhancing a model’s ability to handle new tasks while retaining prior knowledge. This field is a relatively under-explored area, and establishing appropriate datasets is crucial for facilitating communication and research in this field. In this study, we present the first dedicated benchmark, ViLCo-Bench, designed to evaluate continual learning models across a range of video-text tasks. The dataset comprises ten-minute-long videos and corresponding language queries collected from publicly available datasets.

Additionally, we introduce a novel memory-efficient framework that incorporates self-supervised learning and mimics long-term and short-term memory effects. This framework addresses challenges including memory complexity from long video clips, natural language complexity from open queries, and text-video misalignment. We posit that ViLCo-Bench, with greater complexity compared to existing continual learning benchmarks, would serve as a critical tool for exploring the video-language domain, extending beyond conventional class-incremental tasks, and addressing complex and limited annotation issues.

More detailed information can also be found on our url: https://github.com/cruiseresearchgroup/ViLCo

Files

ViLCo_data.zip

Files (15.9 GB)

Name Size Download all
md5:8cce9a6ef2df4f2414c82b0c334891fc
734.0 MB Download
md5:610a5d0ed3f487646ef823b2f7213f03
734.0 MB Download
md5:397d58a41f88409d17bd716fbc1a0569
734.0 MB Download
md5:5316332401938d2a182306347bd5ec72
734.0 MB Download
md5:01ca9e8e350319146ccc71ec687f75fb
734.0 MB Download
md5:0c17b868c36021d400404da67af1cd74
734.0 MB Download
md5:46b0961b3ef3b07afa9c2b0c12f72334
734.0 MB Download
md5:da1ee8ec0da220f0583cc13c3a12b09a
734.0 MB Download
md5:b2f8c2173f95fb0024c5e5d8fd46ae9d
734.0 MB Download
md5:ea74a117b4832b36069d705a63d5a0ff
734.0 MB Download
md5:b4e7c8081a0020ae007872f76af8a320
734.0 MB Download
md5:19d40f92adb1c0dbb527d05e3ebd31e5
734.0 MB Download
md5:bd882e753cde3fa9b59a8ed1c41454f3
734.0 MB Download
md5:4fe352c6c8522e5f71b2e57a12f50d25
734.0 MB Download
md5:75d2b9cae90576ee09fffcfda788e245
734.0 MB Download
md5:b9f8ef9cf8faaad34bde344cfa245479
734.0 MB Download
md5:f2eee7c41864511927152bfd12d3220f
734.0 MB Download
md5:2e05348d3e39b1a4f9c7ba9809d52746
734.0 MB Download
md5:9319ae05b6723950b65fb6c8eb8e4b98
734.0 MB Download
md5:564708db0ee6fd17c5a76538d7e02bfb
734.0 MB Download
md5:09ebb7b916ab0db843c827e11e1a5d95
734.0 MB Download
md5:f80dd798277f7114ce0ed305e3ae5a53
511.3 MB Preview Download

Additional details

Software

Repository URL
https://github.com/cruiseresearchgroup/ViLCo
Programming language
Python