Published June 1, 2021 | Version 1.0

SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning Over Traffic Events

  • 1. SUTD

Description

SUTD-TrafficQA is a dataset that takes the form of Video QA based on 10,080 in-the-wild videos and annotated 62,535 QA pairs, for benchmarking the cognitive capability of causal inference and event understanding models in complex traffic scenarios. Specifically, the dataset proposes 6 challenging reasoning tasks corresponding to various traffic scenarios, so as to evaluate the reasoning capability over different kinds of complex yet practical traffic events.

Files

vid_filename_to_id.json

Files (20.5 GB)

Name Size
md5:d5556bce4f7236545eadf89d489a25f4
10.8 MB Download
md5:792313620470ec2353bc6f34f19e4d1c
1.1 MB Download
md5:c5a5067874c341cd6eaa2d2d4701b960
9.7 MB Download
md5:2942eed698ea35f1412e2a704bcf5539
6.6 GB Download
md5:fc06a8d2774eed1be27a5a706c9592f9
668.2 MB Download
md5:90e0d7a299831db0850d9d64a327d6fb
10.6 GB Download
md5:37b44dc8ee55a61ff52f189890cf616c
2.6 GB Download
md5:50ed7714ba9f16c4a79d9c8e7234faf1
394.1 kB Preview Download
md5:f60ffe9d36e409bbcd1dbbd4603fb9a3
414.2 kB Preview Download

Additional details

Related works

Is described by
Conference paper: 10.48550/arXiv.2103.15538 (DOI)