Published December 3, 2019 | Version v1
Dataset Open

100 Multilingual Wikipedia Articles on Mathematics

  • 1. ROR icon University of Wuppertal
  • 2. ROR icon FIZ Karlsruhe – Leibniz Institute for Information Infrastructure
  • 3. ROR icon University of Göttingen

Description

The dataset contains the latest dump of 100 select Wikipedia articles that deal with mathematical topics in all available languages.

See the report "Mathematical World Knowledge Contained in the Multilingual Wikipedia Project" for a detailed description of the method and the dataset.

 

The script can be accessed via

swh:1:dir:f417f07e04e46374a31eb16f8a1e550dc8bcbd1e;
origin=https://github.com/gipplab/ss19-sem-most-common-formula-across-wikipedia-languages;
visit=swh:1:snp:273653fc5a981ad3269b5a348c37d559ada9d30d;
anchor=swh:1:rev:3b23dea39732f166bbd97657e0a481fd136e523f 

Notes

Except as discussed below, all original textual content is licensed under the  GNU Free Documentation License (GFDL) and the Creative Commons Attribution-Share-Alike 4.0 License. Some text may be available only under the Creative Commons license; see our Terms of Use for details. Text written by some authors may be released under additional licenses or into the public domain. 

https://dumps.wikimedia.org/legal.html

Files

math-qid.zip

Files (74.2 MB)

Name Size Download all
md5:8b090ac28f4cddfdd7d9cf182a060206
74.2 MB Preview Download

Additional details

Related works

Is derived from
Dataset: 10.5281/zenodo.15058129 (DOI)
Is documented by
Conference paper: 10.1007/978-3-030-52200-1_35 (DOI)