Sciweavers

79 search results - page 9 / 16
» Self-Supervised Chinese Word Segmentation
Sort
View
IJCNLP
2004
Springer
15 years 7 months ago
The Use of SVM for Chinese New Word Identification
We present a study of new word identification (NWI) to improve the performance of a Chinese word segmenter. In this paper the distribution and types of new words are discussed emp...
Hongqiao Li, Changning Huang, Jianfeng Gao, Xiaozh...
COLING
2002
15 years 1 months ago
Applying an NVEF Word-Pair Identifier to the Chinese Syllable-to-Word Conversion Problem
Syllable-to-word (STW) conversion is important in Chinese phonetic input methods and speech recognition. There are two major problems in the STW conversion: (1) resolving the ambi...
Jia-Lin Tsai, Wen-Lian Hsu
IJCNLP
2005
Springer
15 years 7 months ago
A Lexicon-Constrained Character Model for Chinese Morphological Analysis
Abstract. This paper proposes a lexicon-constrained character model that combines both word and character features to solve complicated issues in Chinese morphological analysis. A ...
Yao Meng, Hao Yu, Fumihito Nishino
TREC
2000
15 years 3 months ago
English-Chinese Cross-Language IR Using Bilingual Dictionaries
This report describes the English-Chinese crosslanguage retrieval experiments at Berkeley for TREC-9 Cross-Language Information Retrieval track. We present a simple and effective ...
Aitao Chen, Hailing Jiang, Fredric C. Gey
IRAL
2000
ACM
15 years 6 months ago
On the use of words and n-grams for Chinese information retrieval
: In the processing of Chinese documents and queries in information retrieval (IR), one has to identify the units that are used as indexes. Words and n-grams have been used as inde...
Jian-Yun Nie, Jianfeng Gao, Jian Zhang, Ming Zhou