Published June 11, 2021 | Version v1
Video/Audio Open

Crowdsourcing historical text and data with the Chinese Text Project

  • 1. Durham University


Paper presented on Friday 11 June 2021 at the Digital Medievalist Global Symposium The past, present, and future of Digital Medieval Studies for the Asia & Oceania Panel, in the session Engaging in Chinese Literature.

The Chinese Text Project ( is a crowdsourced digital library of premodern Chinese writing, containing over 35 million pages of scanned primary source material and billions of words of transcribed text. In this talk I describe the implementation of a crowdsourced semantic annotation system for these texts, as well as the joint construction of a crowdsourced knowledge graph recording data covering close to 3000 years of Chinese history.


Files (1.0 GB)

Name Size Download all
1.0 GB Download