Published May 18, 2019 | Version v3
Dataset Open

Audio Caption Dataset (Hospital & Car)

  • 1. Shanghai Jiao Tong University

Description

This dataset consists of the Hospital scene of our Audio Caption dataset. Details can be seen in our paper Audio Caption: Listen and Tell published at ICASSP2019. 

Car scene, detailed in Audio Caption in a Car Setting with a Sentence-Level Loss published at ISCSLP 2021.

Original captions in Mandarin Chinese, with English translations provided. 

Files

Car_Label.zip

Files (14.7 GB)

Name Size Download all
md5:ac2cd98347b5c312c9636d4ce1e2006f
904.6 kB Preview Download
md5:c8c4d5393c27de6a850d500e2619659a
6.5 GB Download
md5:9bafc100b8be4b48b643368ba1390e2c
479.2 kB Preview Download
md5:96efbec847bbeca51d0dd61da1f98b8c
8.3 GB Download