Dataset Open Access

# CauseNet: Towards a Causality Graph Extracted from the Web

Heindorf, Stefan; Scholten, Yan; Wachsmuth, Henning; Ngonga Ngomo, Axel-Cyrille; Potthast, Martin

Causal knowledge is seen as one of the key ingredients to advance artificial intelligence. Yet, few knowledge bases comprise causal knowledge to date, possibly due to significant efforts required for validation. Notwithstanding this challenge, we compile CauseNet, a large-scale knowledge base of claimed causal relations between causal concepts. By extraction from different semi- and unstructured web sources, we collect more than 11 million causal relations with an estimated extraction precision of 83% and construct the first large-scale and open-domain causality graph. We analyze the graph to gain insights about causal beliefs expressed on the web and we demonstrate its benefits in basic causal question answering. Future work may use the graph for causal reasoning, computational argumentation, multi-hop question answering, and more.

When using the data, please make sure to refer to it as follows:

@inproceedings{heindorf2020causenet,
author    = {Stefan Heindorf and
Yan Scholten and
Henning Wachsmuth and
Axel-Cyrille Ngonga Ngomo and
Martin Potthast},
title     = {CauseNet: Towards a Causality Graph Extracted from the Web},
booktitle = {{CIKM}},
pages     = {3023--3030},
publisher = {{ACM}},
year      = {2020}
}

Files (2.0 GB)
Name Size
causenet-full.jsonl.bz2
md5:78273d177c5096f89d2367a876b64645
1.8 GB
causenet-precision.jsonl.bz2
md5:8bf12257e71713a63403bc8fe8bf71bf
137.9 MB
causenet-sample.json
md5:662cefd755f046751d1a345fe6abbdf7
55.2 kB
• Stefan Heindorf, Yan Scholten, Henning Wachsmuth, Axel-Cyrille Ngonga Ngomo, and Martin Potthast. CauseNet: Towards a Causality Graph Extracted from the Web. In CIKM 2020, pages 2023-3030. ACM.

57
24
views