Sciweavers

79 search results - page 9 / 16
» Self-Supervised Chinese Word Segmentation
Sort
View
IJCNLP
2004
Springer
15 years 5 months ago
The Use of SVM for Chinese New Word Identification
We present a study of new word identification (NWI) to improve the performance of a Chinese word segmenter. In this paper the distribution and types of new words are discussed emp...
Hongqiao Li, Changning Huang, Jianfeng Gao, Xiaozh...
COLING
2002
14 years 11 months ago
Applying an NVEF Word-Pair Identifier to the Chinese Syllable-to-Word Conversion Problem
Syllable-to-word (STW) conversion is important in Chinese phonetic input methods and speech recognition. There are two major problems in the STW conversion: (1) resolving the ambi...
Jia-Lin Tsai, Wen-Lian Hsu
IJCNLP
2005
Springer
15 years 5 months ago
A Lexicon-Constrained Character Model for Chinese Morphological Analysis
Abstract. This paper proposes a lexicon-constrained character model that combines both word and character features to solve complicated issues in Chinese morphological analysis. A ...
Yao Meng, Hao Yu, Fumihito Nishino
TREC
2000
15 years 1 months ago
English-Chinese Cross-Language IR Using Bilingual Dictionaries
This report describes the English-Chinese crosslanguage retrieval experiments at Berkeley for TREC-9 Cross-Language Information Retrieval track. We present a simple and effective ...
Aitao Chen, Hailing Jiang, Fredric C. Gey
IRAL
2000
ACM
15 years 4 months ago
On the use of words and n-grams for Chinese information retrieval
: In the processing of Chinese documents and queries in information retrieval (IR), one has to identify the units that are used as indexes. Words and n-grams have been used as inde...
Jian-Yun Nie, Jianfeng Gao, Jian Zhang, Ming Zhou