There is a newer version of the record available.

Published June 3, 2019 | Version v1
Dataset Open

Past Causalities and Event Categories for Connecting Similar Past and Present Causalities

  • 1. The University of Tokyo
  • 2. Tokyo Metropolitan Universitytan

Description

This dataset includes past causalities and their categories to connect similar past and present causalities. We report how to use this dataset in the following papers.

Ryohei Ikejiri, Yasunobu Sumikawa: "Developing world history lessons to foster authentic social participation by searching for historical causation in relation to current issues dominating the news". Journal of Educational Research on Social Studies 84, 37–48 (2016). (in Japanese).

Yasunobu Sumikawa and Ryohei Ikejiri, "Mining Historical Social Issues", Intelligent Decision Technologies, Smart Innovation, IDT'15, Systems and Technologies, Vol. 39, Springer, pp. 587--597, 2015.

This dataset is based on some textbooks that are popular ones in Japanese high-school. We first collect past causalities by referencing the textbooks. We then select the causalities if they can be useful for considering solutions for present social issues. To enhance the analogy, we describe each causality in three kinds of texts: background including problems, solution ways, and their results. From the selected causalities and an Encyclopedia of Historiography, we define categories for them. Finally, the created dataset contains 138 past causalities and 13 categories. Each past causality has more than one categories.

 

File contents:

  • FAQ data (*.csv)
    1. historical_causalities_data.tsv: Detail of stored causalities.
    2. historical_causalities_regions.tsv: Regions where the causalities happened.
    3. historical_causalities_categories.tsv: Categories of the causalities.

  • Statistics (Statistics.tsv)

     Results of statistical analyses for the dataset. We used Calinski and Harabaz method, mutual information, Jaccard Index, TF-IDF+JS divergence, and Meta-data Similarity that counts how many common categories two causalities share in order to measure qualities of the dataset.

Grants: JSPS KAKENHI Grant Number 26750076 and 17K12792

Files

Files (85.6 kB)

Name Size Download all
md5:7a68f67e64ab4e3167f60a9539d615e1
4.7 kB Download
md5:0d732388c68a61a975e5b25c354b548b
67.3 kB Download
md5:e4879578c66bcf042d906fd72ee8e50d
1.4 kB Download
md5:480fbb5fd26f520f29c4c5c8275f0c7d
12.1 kB Download