Select Page

Development and Application of Chinese Cross-Lingual Term Extraction for Buddhist Digital Archives

This is a project of the National Science Council`s National Digital Archives Program, Taiwan. It aims to support the construction of a research platform and environment that is convenient for Buddhist scholars to work on subject-specific knowledge structures.

Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.

Feature:

A primary research to combine Information Technology with Buddhist Studies data on Information Retrieval and Extraction techniques, Statistical Analysis, Document Classification and Clustering, Data Mining, etc.

About:

This is a project of the National Science Council`s National Digital Archives Program, Taiwan. It aims to support the construction of a research platform and environment that is convenient for Buddhist scholars to work on subject-specific knowledge structures.

Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.

Content:

  • Indices and Algorithms
  • Cross-Language Corpora
  • CBETA Statistics
  • Concordance Search