Select Page

DEDU — an Online Editor of Parallel Corpus

DEDU -- an Online Editor of Parallel Corpus

Feature : A parallel text editor that everyone can learn to use. No installation required, get started right away, create a text to read in the most intuitive way, and share it with the world immediately.

About:

This is an online text editing tool that makes it convenient for users to compile multiple data sources, and quickly create a table of correspondences. This editor has the following characteristics:

  1. Provides a WYSIWYG online editing environment, so everyone can easily get started.
  2. Provides a tree structure guidance mechanism, which can easily handle large file correspondences.
  3. Makes it easy tp share the project with multiple team members to collaborate together.
  4. The results can be previewed at any time and shared with the world.
  5. Result data can be extracted at any time, and is not restricted by platform.

An index to the footnotes of the Japanese translation of the Dīrgha-āgama (Taishō 1, 長阿含經), Tokyo, 1996–2006

An index to the footnotes of the Japanese translation of the Dīrgha-āgama (Taishō 1, 長阿含經), Tokyo, 1996–2006

This large translation project was the joint effort of sinologically trained linguists and philologists trained in Indology and Buddhist studies. Their combined expertise resulted in rich research footnotes that deal with problematic terminology and readings from a perspective that takes into account both linguistic and content aspects of the Āgama collection. This makes the index to the words and passages discussed in the footnotes particularly useful for scholars of the early Buddhist Āgamas and for scholars working on the Chinese translations of Buddhist texts in general.

Feature:

Digitalized index to footnotes of the 6-volumes Japanese translation of the Dīrgha-āgama.

About:

The Āgama Research Group at DILA has digitized, edited and supplemented an unpublished index to footnotes of the 6-volumes Japanese translation of the Dīrgha-āgama (《長阿含經》Taishō 1) in the format of XML, HTML and pdf.

Content:

Reference:
Karashima Seishi (辛嶋静志)、Okayama Hajime (丘山新)、Kamitsuka Yoshiko (神塚淑子)、Kanno Hiroshi (菅野博史)、Sueki Fumihiko (末木文美士)、Hikita Hiromichi (引田弘道) 以及 Matsumura Takumi (松村巧),Gendaigo yaku agon kyōten: Jōagongyō. 現代語訳「阿含経典」, Tokyo: Hirakawa Shuppansha (平河出版社),1996-2005,六冊。

This large translation project was the joint effort of sinologically trained linguists and philologists trained in Indology and Buddhist studies. Their combined expertise resulted in rich research footnotes that deal with problematic terminology and readings from a perspective that takes into account both linguistic and content aspects of the Āgama collection. This makes the index to the words and passages discussed in the footnotes particularly useful for scholars of the early Buddhist Āgamas and for scholars working on the Chinese translations of Buddhist texts in general.

The 6-volumes translation is based on an earlier serialized translation that was originally published in monthly instalments in Gekkan Āgama (月刊アーガマ). The incomplete unpublished index was first prepared on the basis of this serialised translation. The person responsible for assembling the original index – compiled by a group of Japanese students – is Eiichi Enomoto (榎本榮一), formerly editor at Hirakawa Shuppansha (平河出版社). The publisher Hirakawa Shuppansha (平河出版社) has granted its permission to use, modify and make freely available the index.

The draft version has been first digitized, then matched with the 6-volumes edition, supplemented and corrected wherever necessary. The present index covers all words and phrases significantly discussed in the footnotes of the 6-volumes edition.

Bibliotheca Sacra Birmanica

Bibliotheca Sacra Birmanica

This is a collaboration between Chung-Hwa Institute of Buddhist Studies and the École francaise d`Extrême-Orient. A total of 245 palm-leaf manuscripts were catalogued. They were inscribed in one script but in two languages: Pāli and Burmese. Some are in Pāli only, which are texts, commentaries of the Theravādin Tripiṭaka, as well as independent compositions. Some texts are in Burmese only, and others in Pāli and Burmese, in the form of phrase-by-phrase translation and commentary on Pāli texts. The oldest manuscript is dated 1724, the youngest one is dated 1930.

Feature:

Catalogue and information on 245 palmleaf manuscripts in two languages: Pāli and Burmese, covering commentaries, history and grammar.

About:

This is a collaboration between Chung-Hwa Institute of Buddhist Studies and the École francaise d`Extrême-Orient. A total of 245 palm-leaf manuscripts were catalogued. They were inscribed in one script but in two languages: Pāli and Burmese. Some are in Pāli only, which are texts, commentaries of the Theravādin Tripiṭaka, as well as independent compositions. Some texts are in Burmese only, and others in Pāli and Burmese, in the form of phrase-by-phrase translation and commentary on Pāli texts. The oldest manuscript is dated 1724, the youngest one is dated 1930.

Content:

Most texts – 127 titles – belong to the Vinaya. Texts related to Suttantapitaka are represented by 66 titles, while 67 texts belong to the Abhidhamma. Next to narrative literature, poetry and historiographical works the collection includes 49 grammars or lexical works. Texts include the Sīhalavatthu(ppakaraṇa) (a collection of narratives), the devotional poetry of the Jinālavkāra and the compendium Sāratthasavgaha. There is a nearly complete set of the Sri Lankan chronicles including the rare Nalātadhātuvaṃsa (Chronicle of the frontal bone relic of the Buddha) never critically edited. There are many Pāli-Burmese nissaya, most of them unknown to other catalogues. Finally grammars or lexical works including Abhidhānappadīpikā, Kaccāyanapakaraṇa, Kaccāyanavutti and Saddanīti, covering essential fields like history and grammar.

Annoucement:

Distributed in CD-ROM since 2002. Kindly contact the Digital Archive Group for more information: da@dila.edu.tw

Catalog Database of Republican Era Buddhist Journals

Catalog Database of Republican Era Buddhist Journals

For the ease of scholars and those who study the Republican era of Chinese Buddhism, this catalogued database of Minguo fojiao qikan wenxian jicheng 《民國佛教期刊文獻集成》(Complete Collection of Republican-Era Buddhist Periodical Literature) has thus built. It records 152 types of Chinese Buddhist Journals (i.e. over 140,000 articles) with content including Buddhist Studies writings, image archives, contemporary news and publication announcements. In order to get a more comprehensive view on imminent figures, history and interaction among Buddhist during the Republican era, one may find this database robustly handy.

Feature:

The Minguo fojiao qikan wenxian jicheng 《民國佛教期刊文獻集成》consists 152 types of Chinese Buddhist Journals during the Republican Era (1911-1949).

About:

For the ease of scholars and those who study the Republican era of Chinese Buddhism, this catalogued database of Minguo fojiao qikan wenxian jicheng 《民國佛教期刊文獻集成》(Complete Collection of Republican-Era Buddhist Periodical Literature) has thus built. It records 152 types of Chinese Buddhist Journals (i.e. over 140,000 articles) with content including Buddhist Studies writings, image archives, contemporary news and publication announcements. In order to get a more comprehensive view on imminent figures, history and interaction among Buddhist during the Republican era, one may find this database robustly handy.

Digital Database of Buddhist Tripitaka Catalogs

Digital Database of Buddhist Tripitaka Catalogs

To proper present enormous information derived from Buddhist texts based on current digital archives of CBETA, as well as traditional sutra catalogues contained in various Tripiṭaka editions and versions, Shōwa Hōhō sōmokuroku 《昭和法寶 總目錄》, Hobogirin : Repertoire du canon Bouddhique Sino-Japonais edition de Taishō Tripiṭaka《法寶義林–大正大藏經總索引 》, etc., this database utilises TEI to markup and systemize knowledge organization and management.

Feature:

Database of ancient and modern editions of Buddhist Tripiṭaka Catalogues, as well as information on texts, and search functions.

About:

To proper present enormous information derived from Buddhist texts based on current digital archives of CBETA, as well as traditional sutra catalogues contained in various Tripiṭaka editions and versions, Shōwa Hōhō sōmokuroku 《昭和法寶 總目錄》, Hobogirin : Repertoire du canon Bouddhique Sino-Japonais edition de Taishō Tripiṭaka《法寶義林–大正大藏經總索引 》, etc., this database utilises TEI to markup and systemize knowledge organization and management.

Content:

  • Complete catalogues of ancient and modern edition
  • Search facilities to check all occurrences of a text (across) catalogues
  • Link to the corresponding full text
  • Organization of textual information

The Study and Creation of Buddhist Lexicographical Resources

The Study and Creation of Buddhist Lexicographical Resources

By providing a robust research platform, this is one of the sub-project in NSC Digital Archive Technology program, designed for Buddhist Studies scholars. The cornerstones of this project are term extraction and multilingual corpus building. Given the complexity of Tripiṭaka in different languages and editions, these databases may assist in textual studies and reference compilation. Meanwhile, statistical data might inspire further studies and research.

Feature:

Texts were digitized into databases with the TEI Markup. These databases not only function as conventional research tools, one may also perform meta-search across databases, get statistics, make comparison, cite & paste, etc.

About:

By providing a robust research platform, this is one of the sub-project in NSC Digital Archive Technology program, designed for Buddhist Studies scholars. The cornerstones of this project are term extraction and multilingual corpus building. Given the complexity of Tripiṭaka in different languages and editions, these databases may assist in textual studies and reference compilation. Meanwhile, statistical data might inspire further studies and research.

Content:

  • Indices and Algorithms
  • Cross-Language Corpora
  • CBETA Statistics
  • Concordance Search

Glossaries for Buddhist Studies

Glossaries for Buddhist Studies

A collection of Buddhist Studies glossaries built with open standard as downloadable open sources in the format of TEI XML, HTML, PDF, Babylon, Stardict, ePub, Dr. eye, etc. Currently, there are twelve dictionaries available, covering Chinese, English, Sanskrit, Pāli, Tibetan, Manchurian and Mongolian languages, with over hundred thousands entries.

Feature:

A growing collection of Buddhist Studies glossaries offered in various formats (XML-TEI, HTML, PDF, ePub etc.).

About:

A collection of Buddhist Studies glossaries built with open standard as downloadable open sources in the format of TEI XML, HTML, PDF, Babylon, Stardict, ePub, Dr. eye, etc. Currently, there are twelve dictionaries available, covering Chinese, English, Sanskrit, Pāli, Tibetan, Manchurian and Mongolian languages, with over hundred thousands entries.

Content:

This project currently provide following dictionary files in various formats:

  • Dama Bhikkhu: A Chinese Translation of A.P. Buddhadatta’s Concise Pali-English Dictionary : Digital Edition
  • Dan Martin: Tibskrit Philology
  • Digital Index of Noun and Verb Endings of Pāli Grammar
  • Ding Fubao: Dictionary of Buddhist Studies
  • Jeffrey Hopkins: Tibetan-Sanskrit-English Dictionary
  • Mahāvyutpatti
  • Pentaglot Dictionary of Buddhist Terms
  • Rod Bucknell: Digital Index of Noun & Verb Ending of Sanskrit Manual
  • Seishi Karashima: A Glossary of Dharmarakṣa’s Translation of the Lotus Sūtra
  • Seishi Karashima: A Glossary of Kumārajīva’s translation of the Lotus Sūtra
  • Seishi Karashima: A Glossary of Lokakṣema’s Translation of the Aṣṭasāhasrikā Prajñāpāramitā
  • Seishi Karashima: Glossary to the Abhisamācārikā Dharmāḥ of the Mahāsāṃghika-Lokottaravādins
  • Seishi Karashima: 「長阿含経」の原語の研究 (A study of the language of the Dīrgha-āgama)
  • Shi, Zhi-yu: Nanshan Vinaya Dictionary
  • Soothill-Hodous: A Dictionary of Chinese Buddhist Terms

dila Authority Databases

dila Authority Databases

These databases integrate information from various projects of the Digital Archive Group. By providing information on Chinese calendar, Tripiṭaka catalogues, as well as an onomasticon of place and person from Buddhist sources, they help with disambiguating information and geo-spatial referencing of names and dates.

Feature:

Authority databases for Chinese dates, name of Buddhist personal, places and Tripiṭaka catalogues.

About:

These databases integrate information from various projects of the Digital Archive Group. By providing information on Chinese calendar, Tripiṭaka catalogues, as well as an onomasticon of place and person from Buddhist sources, they help with disambiguating information and geo-spatial referencing of names and dates.

Development and Application of Chinese Cross-Lingual Term Extraction for Buddhist Digital Archives

Development and Application of Chinese Cross-Lingual Term Extraction for Buddhist Digital Archives

This is a project of the National Science Council`s National Digital Archives Program, Taiwan. It aims to support the construction of a research platform and environment that is convenient for Buddhist scholars to work on subject-specific knowledge structures.

Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.

Feature:

A primary research to combine Information Technology with Buddhist Studies data on Information Retrieval and Extraction techniques, Statistical Analysis, Document Classification and Clustering, Data Mining, etc.

About:

This is a project of the National Science Council`s National Digital Archives Program, Taiwan. It aims to support the construction of a research platform and environment that is convenient for Buddhist scholars to work on subject-specific knowledge structures.

Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.

Content:

  • Indices and Algorithms
  • Cross-Language Corpora
  • CBETA Statistics
  • Concordance Search

Integrating Digital Archives and Information Technologies for Building a Digital Research Environment

Integrating Digital Archives and Information Technologies for Building a Digital Research Environment

This multifaceted project is divided into three sub-projects, with the main objective to utilise new digital techniques to build a “Digital Research Environment for Humanistic Studies”. With Humanities Information as core of the project, each sub-project has its own theme and goal, but also supports the development of other project, which together as high quality virtual archives and research platform.

About:

Following introduction of Buddhism to China, a vast body of Buddhist texts accumulated as time passed. Through an extended period of translation, language thus evolved. These texts not only became valuable data for research on the complexity of Buddhist thoughts and historical development of Buddhism, but also raised a great number of textual questions. Traditional philological work often requires extensive textual comparisons, which is labor-intensive and time consuming. To aid in the effort of scholars to overcome these difficulties and enable them to engage in a larger scale of studies, we designed an integrated digital research platform which willconsist of the compilation of existing scriptural materials and various digital tools. We plan to equip the platform with the following features within two years:

  • A complete collection of digitized scriptures for accessible and comparative research.
  • A complete collection of various historical Chinese catalogues of Buddhist texts to provide crucial background information.
  • A dictionary on characters, names and locations.
  • Statistical information which facilitate analysis and comparison.
  • Tools to analyze various translation errors that have accumulated over centuries, including incorrect characters, missing words, etc.

Content:

This multifaceted project is divided into three sub-projects, with the main objective to utilise new digital techniques to build a “Digital Research Environment for Humanistic Studies”. With Humanities Information as core of the project, each sub-project has its own theme and goal, but also supports the development of other project, which together as high quality virtual archives and research platform.