Published March 2, 2024
| Version v3
Dataset
Open
MIntRec2. 0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Creators
Description
MIntRec2.0, a large-scale benchmark dataset for multimodal intent recognition in multi-party conversations. It contains 1,245 high-quality dialogues with 15,040 samples, each annotated within a new intent taxonomy of 30 fine-grained classes, across text, video, and audio modalities. In addition to more than 9,300 in-scope samples, it also includes over 5,700 out-of-scope samples appearing in multi-turn contexts, which naturally occur in real-world open scenarios, enhancing its practical applicability. This dataset will be released under the CC BY-NC-SA 4.0 license.
Files
in-scope_text_data.zip
Files
(7.9 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:81d075ffd7fba4c0abdfa72fb77aa53c
|
4.1 GB | Download |
|
md5:3f346f8e32fc333c03f4faf124ca29db
|
313.6 kB | Preview Download |
|
md5:c9516cade914af6f9fca2b92004208b4
|
652.0 MB | Download |
|
md5:540d558650809dbade61804ce4c4b1fd
|
103 Bytes | Preview Download |
|
md5:036b7f9b58b34a8f2dce31151537ad82
|
2.7 GB | Download |
|
md5:12d1458712a1750ce16c2606bdf81c36
|
192.1 kB | Preview Download |
|
md5:fd5ae3ac5df8bcfaabe1bdadcefe2047
|
421.8 MB | Download |
|
md5:717eb9b5f40fbc30867bee25a3c75df4
|
1.0 kB | Preview Download |