Sciweavers

70 search results - page 1 / 14
» Using self-supervised word segmentation in Chinese informati...
Sort
View
107
Voted
IDA
2001
Springer
15 years 5 months ago
Self-Supervised Chinese Word Segmentation
Abstract. We propose a new unsupervised training method for acquiring probability models that accurately segment Chinese character sequences into words. By constructing a core lexi...
Fuchun Peng, Dale Schuurmans
98
Voted
SIGIR
2002
ACM
15 years 9 days ago
Using self-supervised word segmentation in Chinese information retrieval
We propose a self-supervised word-segmentation technique for Chinese information retrieval. This method combines the advantages of traditional dictionary based approaches with cha...
Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick ...
104
Voted
TREC
2000
15 years 2 months ago
English-Chinese Cross-Language IR Using Bilingual Dictionaries
This report describes the English-Chinese crosslanguage retrieval experiments at Berkeley for TREC-9 Cross-Language Information Retrieval track. We present a simple and effective ...
Aitao Chen, Hailing Jiang, Fredric C. Gey
88
Voted
IRAL
2000
ACM
15 years 5 months ago
On the use of words and n-grams for Chinese information retrieval
: In the processing of Chinese documents and queries in information retrieval (IR), one has to identify the units that are used as indexes. Words and n-grams have been used as inde...
Jian-Yun Nie, Jianfeng Gao, Jian Zhang, Ming Zhou
99
Voted
COLING
2002
15 years 15 days ago
Investigating the Relationship between Word Segmentation Performance and Retrieval Performance in Chinese IR
It is commonly believed that word segmentation accuracy is monotonically related to retrieval performance in Chinese information retrieval. In this paper we show that, for Chinese...
Fuchun Peng, Xiangji Huang, Dale Schuurmans, Nick ...