Development and Application of Chinese Cross-Lingual Term Extraction for Buddhist Digital Archives
Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.
Feature:
A primary research to combine Information Technology with Buddhist Studies data on Information Retrieval and Extraction techniques, Statistical Analysis, Document Classification and Clustering, Data Mining, etc.
About:
This is a project of the National Science Council`s National Digital Archives Program, Taiwan. It aims to support the construction of a research platform and environment that is convenient for Buddhist scholars to work on subject-specific knowledge structures.
Term extraction is an important foundation towards building such a platform. Term extraction and cross-lingual thesaurus for the Tripiṭaka in different versions and languages can also be applied to reference construction, manuscripts studies and document proving. Statistical questions can also be derived.
Content:
- Indices and Algorithms
- Cross-Language Corpora
- CBETA Statistics
- Concordance Search