There is a newer version of the record available.

Published February 5, 2019 | Version v1
Dataset Open

Japanese FAQ dataset for e-learning system

  • 1. Tokyo Metropolitan University

Description

This dataset includes FAQ data and their categories to train a chatbot specialized for e-learning system used in Tokyo Metropolitan University. We report accuracies of the chatbot in the following paper.

Yasunobu Sumikawa, Masaaki Fujiyoshi, Hisashi Hatakeyama, and Masahiro Nagai "Supporting Creation of FAQ Dataset for E-learning Chatbot", Intelligent Decision Technologies, Smart Innovation, IDT'19, Springer, 2019, to appear.

 

This dataset is based on real Q&A data about how to use the e-learning system asked by students and teachers who use it in practical classes. The duration we collected the Q&A data is from April 2015 to July 2018.

 

File contents:

  • FAQ data (*.csv)
    1. Answer2Category.csv: Categories of answers.
    2. Answer2Tag.csv: Titles of answers.
    3. Answers.csv: IDs for answers and texts of answers.
    4. Categories.csv: Names of categories for answers.
    5. Questions.csv: Texts of questions and their corresponding answer IDs.

  • Statistics (*.tsv)

     Results of statistical analyses for the dataset. We used Calinski and Harabaz method, mutual information, Jaccard Index, TF-IDF+KL divergence, and TF-IDF+JS divergence in order to measure qualities of the dataset. In the analyses, we regard each answer as a cluster for questions. We also perform the same analyses for categories by regarding them as clusters for answers.

Grants: JSPS KAKENHI Grant Number 18H01057

Files

Answer2Category.csv

Files (1.7 MB)

Name Size Download all
md5:7b4a6df4d21cf0224afc59036e28489a
957 Bytes Preview Download
md5:6bbf666f172e70554378dda505c7718a
2.0 kB Preview Download
md5:483ca145d99b7a1a7c2cbaba52573a89
42.9 kB Preview Download
md5:49241c18fa7fffd147fb192569c87def
525 Bytes Download
md5:b050b427368cd9ab7b8794234ffcef0b
239 Bytes Preview Download
md5:f425a911808363e77e14d439f559eb23
573 Bytes Download
md5:77f5047fb39a3f7d8a4248ae7339845a
78.8 kB Download
md5:0b041e1427b28da3f9bfb914fb80f9d5
7.2 kB Download
md5:494b1b4066db795b2c1191bc8893ed9d
10.5 kB Download
md5:833761f0bb63e6d081336aff9be65208
1.5 MB Download
md5:74c92d0ea256cf44e58122558d269f0c
7.2 kB Download
md5:88b928168fb587efce824a5e266ee06f
35.3 kB Preview Download
md5:29199fbaa1e64503ee571c6b7f0afe2a
1.4 kB Download