Published March 2, 2024 | Version v3
Dataset Open

MIntRec2. 0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

Creators

Description

MIntRec2.0, a large-scale benchmark dataset for multimodal intent recognition in multi-party conversations. It contains 1,245 high-quality dialogues with 15,040 samples, each annotated within a new intent taxonomy of 30 fine-grained classes, across text, video, and audio modalities. In addition to more than 9,300 in-scope samples, it also includes over 5,700 out-of-scope samples appearing in multi-turn contexts, which naturally occur in real-world open scenarios, enhancing its practical applicability. This dataset will be released under the CC BY-NC-SA 4.0 license.

Files

in-scope_text_data.zip

Files (7.9 GB)

Name Size Download all
md5:81d075ffd7fba4c0abdfa72fb77aa53c
4.1 GB Download
md5:3f346f8e32fc333c03f4faf124ca29db
313.6 kB Preview Download
md5:c9516cade914af6f9fca2b92004208b4
652.0 MB Download
md5:540d558650809dbade61804ce4c4b1fd
103 Bytes Preview Download
md5:036b7f9b58b34a8f2dce31151537ad82
2.7 GB Download
md5:12d1458712a1750ce16c2606bdf81c36
192.1 kB Preview Download
md5:fd5ae3ac5df8bcfaabe1bdadcefe2047
421.8 MB Download
md5:717eb9b5f40fbc30867bee25a3c75df4
1.0 kB Preview Download