Buddhist Classics AI Translation Series Volume 16 , Kangyur and Tengyur Exoteric Scriptures in Pali, Japanese, and Korean Translations , Version 1.2(佛典AI译丛第十六卷 巴利日韩译甘珠尔丹珠尔显教经论)
Authors/Creators
Description
Buddhist Classics AI Translation Series Vol. 16: Pali, Japanese, and Korean Translations of the Kangyur and Tengyur Exoteric Scriptures (Version 1.0)
Overview
This dataset constitutes Volume 16 of the Buddhist Classics AI Translation Series. It represents a pioneering effort to translate the exoteric original texts of the Tibetan Buddhist Kangyur (Translation of the Word) and Tengyur (Translation of Treatises) into three major languages: Pali (presented in Roman script and 8 other Asian scripts), Japanese, and Korean. Following Volume 15, this project aims to utilize the Tibetan canon—which preserves the most complete aspect of late South Asian Buddhism—as a source to introduce these distinct philosophical systems to the linguistic worlds of Southern Buddhism and the East Asian Sinosphere.
Source Text and Methodology
The primary source text is the "Old Derge Edition" (Sde dge) of the Tibetan Canon. Chosen for its superior collation and clear calligraphy, this edition faithfully preserves the original textual integrity of the 18th century, avoiding errors common in later reprints.
The translation process utilized a multi-model AI collaboration approach (incorporating Gemini 2.5, and GPT-5.0).
Tibetan to Pali: An experimental reconstruction of "Mahayana Scholastic Pali." By utilizing standard Pali grammar to carry the rigorous logical deductions of late Mahayana thought, this project bridges the linguistic gap between the Northern and Southern traditions. The Pali text is provided in Roman script as well as Thai, Myanmar, Sinhala, Lao, Khmer, Tai Le, Shan, and Tai Tham scripts.
Tibetan to Japanese & Korean: Leveraging the syntactic similarities (SOV structure) between Tibetan, Japanese, and Korean, these translations achieve a high degree of logical precision, particularly for complex epistemological treatises (Hetuvidyā), restoring nuances often lost in Chinese translations.
Content Scope
This volume focuses exclusively on Exoteric Scriptures, excluding Tantra.
Kangyur: Includes major Mahayana sutras such as the Prajñāpāramitā, Ratnakūṭa, and Avataṃsaka, along with the Vinaya.
Tengyur: The core contribution of this volume. It features Madhyamaka treatises by Nagarjuna and Chandrakirti, Yogacara works by Asanga and Vasubandhu, and, crucially, the Hetuvidyā (Logic) masterpieces of Dignaga and Dharmakirti, including the Abhisamayalamkara commentaries which are often absent or incomplete in the Chinese canon.
Philosophical Reflection: The Limits of Logic
A unique feature of this volume is its critical engagement with Buddhist Logic (Hetuvidyā). Through the translation process and AI analysis, the series explores the structural tension within ancient Indian logic: the paradox between establishing "Universal Norms" (which require an infinite observer) and serving "Finite Cognition" (the ordinary practitioner). This volume offers not just a translation, but a deep reflection on how these ancient logical tools function as a "belief reinforcement cycle" rather than absolute ontological proofs, providing a mirror for modern challenges in AI and cognitive science.
Since January 29, 2026, we have been attempting to address the issue of Tibetan Wylie transliteration appearing in the Pali translations. Most of these transliterations appear in the form of Tibetan-Pali parallel texts, but there are also untranslated portions. Because both use Latin script, the system cannot automatically identify them without AI reading. The final proofreading software we developed uses a probability-based approach: if Pali vocabulary frequency in a text falls below 45%, it will be retranslated regardless of whether it is parallel text or not, and we have designed an electronic collation program for this purpose. Since the Pali referred to here is constructed Mahayana Pali, the system may treat Wylie transliteration as a form of Sanskrit and incorporate it as loan words into the text. After all, Tibetan-to-Pali translation faces a severe vocabulary deficiency problem. Using Tibetan transliteration, especially phonetic transliteration, as loan words has far weaker acceptability in Pali-speaking regions than Sanskrit loan words. This means that this Tibetan-Pali translation can only provide a general understanding, and many parts require guesswork or assistance from AI or other translations in this volume. The translation series is about to undergo a break, and we can only make appropriate macro-level probabilistic corrections without fully adjusting details. These can only be reconsidered for retranslation after a long interval in the future. For readers with urgent needs in this regard, the only option in the near term is to try to understand the content on your own. Our apologies.
Buddhist Scriptures AI Translation Series
January 31, 2026
中文介绍
标题:佛典AI译丛第十六卷:巴利日韩译甘珠尔丹珠尔显教经论 (1.0版)
概览
本数据集为《佛典AI译丛》第十六卷。本卷利用前沿AI技术,将藏传佛教《甘珠尔》(佛语部)和《丹珠尔》(论疏部)中的显宗原典,翻译为巴利文(提供罗马字及8种亚洲字体)、日文及韩文。继第十五卷之后,本工作旨在以保存晚期南亚佛教面貌最为完整的藏文载体为源头,向南传佛教语言体系及东亚汉字文化圈进行深度译介。
底本与方法论
本卷底本选用在版本学上享有崇高地位的藏文“旧德格版” (Old Derge Edition)。
翻译工作采用了多模型AI协作模式(整合了 Gemini 2.5, GPT-5.0 等模型能力):
藏译巴利文:这是一项具有实验性质的“大乘经院巴利语”重构工程。我们尝试用巴利语的语法框架来承载陈那、法称严密的逻辑推演,打破了“大乘思想无法用巴利语表达”的藩篱。
藏译日韩文:利用藏语与日韩语在句法结构(SOV)和助词系统上的天然同源优势,本卷实现了极高精度的逻辑还原,特别是填补了东亚佛教在晚期因明量学文献上的空白。
收录范围
本卷严格遵循“显密分流”原则,仅收录显教文献:
甘珠尔:收录《般若经》、《宝积经》、《华严经》及《毗奈耶》(律藏)等。
丹珠尔:本卷的核心亮点。包含龙树、月称的中观论著,无著、世亲的唯识论典,以及陈那、法称的因明(逻辑学)著作。特别是藏文独有的《现观庄严论》及其庞大的注疏系统,是本次翻译的重点。
哲学反思:因明的局限性
本卷不仅是文本的转换,更包含对佛教逻辑(Hetuvidyā)有效性边界的深度审视。通过AI分析,我们在序言中探讨了古印度因明在“全称规范”(要求无限视角)与“有限认知”(凡夫视角)之间存在的结构性张力。这种反思揭示了因明作为一套思维训练系统的价值,同时也指出了其作为认识论工具的局限,为现代认知科学提供了独特的参照。
1.2版说明
自2026.01.29起,尝试解决巴利文译本中所出现的藏文威利转写的问题,这些转写大部分是以藏巴对照的形式出现,但也有没翻译的部分,因为都是拉丁字体,所以系统无法在不人工智能读取的情况下自动尽行挑选。最后制作的校勘软件是采取概率的形式,也就是假设在一篇中巴利文词汇出现的频率在45%以下,则不论是否对照,都进行重译,并为此设计了电子校勘程序。因为这里所指的巴利文是构建性的大乘巴利,所以系统可能把威利转写也作为梵文的一种作为借词兑入文本,毕竟藏译巴利有非常大的词汇缺乏问题,将藏文转写特别是音转作为借词,比梵文借词在巴利语地区的通用性要弱很多,这意味着本次藏文巴利文翻译只能看个大概,很多地方要靠猜或者借助AI或本卷其他译本。译丛休整在即,只能在概率上进行适当宏观修补,细节上无法完全进行调整,这些只有未来隔一个长时段看情况重译。如果有这方面紧急需求的读者,在近期只有自己想办法搞清楚内容这一途径,抱歉。
佛典AI译丛
2026.1.31
Files
Files
(570.4 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:6719e703db3bf8d8df2762b4a123484e
|
570.4 MB | Download |
Additional details
Dates
- Collected
-
2026-01-20