Published July 5, 2022 | Version 1.0
Dataset Open

T-X corpus

  • 1. University of Heidelberg


This corpus includes the Taishō and Xuzangjing/Zokuzōkyō collections of Chinese Buddhist texts, as digitised by CBETA, processed so that they are ready for use with the text-analysis tool TACL or the TACL GUI

NOTE: This corpus was modified in March 2023 to fix some problems in the way the TACL code was processing the CBETA XML. Those problems, and the corresponding fixes, are described in this document



Files (739.9 MB)

Name Size Download all
739.9 MB Preview Download