There is a newer version of the record available.

Published July 5, 2022 | Version 1.0
Dataset Open

Radich Taisho corpus

  • 1. University of Heidelberg

Description

NOTE: THIS VERSION HAS BEEN DEPRECATED (BUT ZENODO DOES NOT ALLOW USERS TO "UNPUBLISH" DATA SETS). PLEASE ENSURE YOU ARE USING THE LATEST VERSION: SEE "See all X version" IN THE MENU BAR ON THE RIGHT. 

This packet provides the Taishō canon of Chinese Buddhist texts, as digitised by CBETA. The corpus has been modified by Michael Radich to reflect certain text-historical realities as understood by current scholarly consensus and/or the best current scholarship. The files have then been modified so that they can be used with the text-analysis tool TACL, or the TACL GUI. The corpus is described here. This corpus corresponds to the "full TACL Taisho database", also for use with TACL (and the GUI), which is available for download here.

NOTE: This corpus was modified and re-released in March 2023, to compensate for two problems in the CBETA XML discovered after the first release; those problems and the corresponding fixes are described in this document.

Files

Radich Taisho corpus.zip

Files (785.0 MB)

Name Size Download all
md5:e59c4cf6c3f94d8dc83699aed929f68b
785.0 MB Preview Download