Development and Application of Chinese Cross-Lingual Term Extraction for Buddhist Digital Archives

This is a project of the National Science Council`s National Digital Archives Program, Taiwan. It aims to support the construction of a research platform and environment that is convenient for Buddhist scholars to work on subject-specific knowledge structures.

Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.

URL : http://dev.dila.edu.tw/BuddhistTermExtract

Feature:

A primary research to combine Information Technology with Buddhist Studies data on Information Retrieval and Extraction techniques, Statistical Analysis, Document Classification and Clustering, Data Mining, etc.

About:

Content:

Indices and Algorithms
Cross-Language Corpora
CBETA Statistics
Concordance Search

Development and Application of Chinese Cross-Lingual Term Extraction for Buddhist Digital Archives