Japanese FAQ dataset for e-learning system
Authors/Creators
- 1. Tokyo Metropolitan University
Description
This dataset includes FAQ data and their categories to train a chatbot specialized for e-learning system used in Tokyo Metropolitan University. We report accuracies of the chatbot in the following paper.
Yasunobu Sumikawa, Masaaki Fujiyoshi, Hisashi Hatakeyama, and Masahiro Nagai "Supporting Creation of FAQ Dataset for E-learning Chatbot", Intelligent Decision Technologies, Smart Innovation, IDT'19, Springer, 2019, to appear.
This dataset is based on real Q&A data about how to use the e-learning system asked by students and teachers who use it in practical classes. The duration we collected the Q&A data is from April 2015 to July 2018.
File contents:
- FAQ data (*.csv)
- Answer2Category.csv: Categories of answers.
- Answer2Tag.csv: Titles of answers.
- Answers.csv: IDs for answers and texts of answers.
- Categories.csv: Names of categories for answers.
- Questions.csv: Texts of questions and their corresponding answer IDs.
- Statistics (*.tsv)
Results of statistical analyses for the dataset. We used Calinski and Harabaz method, mutual information, Jaccard Index, TF-IDF+KL divergence, and TF-IDF+JS divergence in order to measure qualities of the dataset. In the analyses, we regard each answer as a cluster for questions. We also perform the same analyses for categories by regarding them as clusters for answers.
Grants: JSPS KAKENHI Grant Number 18H01057
Files
Answer2Category.csv
Files
(1.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:7b4a6df4d21cf0224afc59036e28489a
|
957 Bytes | Preview Download |
|
md5:6bbf666f172e70554378dda505c7718a
|
2.0 kB | Preview Download |
|
md5:483ca145d99b7a1a7c2cbaba52573a89
|
42.9 kB | Preview Download |
|
md5:49241c18fa7fffd147fb192569c87def
|
525 Bytes | Download |
|
md5:b050b427368cd9ab7b8794234ffcef0b
|
239 Bytes | Preview Download |
|
md5:f425a911808363e77e14d439f559eb23
|
573 Bytes | Download |
|
md5:77f5047fb39a3f7d8a4248ae7339845a
|
78.8 kB | Download |
|
md5:0b041e1427b28da3f9bfb914fb80f9d5
|
7.2 kB | Download |
|
md5:494b1b4066db795b2c1191bc8893ed9d
|
10.5 kB | Download |
|
md5:833761f0bb63e6d081336aff9be65208
|
1.5 MB | Download |
|
md5:74c92d0ea256cf44e58122558d269f0c
|
7.2 kB | Download |
|
md5:88b928168fb587efce824a5e266ee06f
|
35.3 kB | Preview Download |
|
md5:29199fbaa1e64503ee571c6b7f0afe2a
|
1.4 kB | Download |