Sciweavers

13 search results - page 1 / 3
» Unknown Word Extraction for Chinese Documents
Sort
View
97
Voted
COLING
2002
14 years 11 months ago
Unknown Word Extraction for Chinese Documents
There is no blank to mark word boundaries in Chinese text. As a result, identifying words is difficult, because of segmentation ambiguities and occurrences of unknown words. Conve...
Keh-Jiann Chen, Wei-Yun Ma
WWW
2007
ACM
16 years 10 days ago
A search-based Chinese word segmentation method
In this paper, we propose a novel Chinese word segmentation method which leverages the huge deposit of Web documents and search technology. It simultaneously solves ambiguous phra...
Xin-Jing Wang, Yong Qin, Wen Liu
104
Voted
ICMCS
2007
IEEE
130views Multimedia» more  ICMCS 2007»
15 years 6 months ago
Word Topical Mixture Models for Extractive Spoken Document Summarization
This paper considers extractive summarization of Chinese spoken documents. In contrast to conventional approaches, we attempt to deal with the extractive summarization problem und...
Berlin Chen, Yi-Ting Chen
96
Voted
APWEB
2008
Springer
15 years 24 days ago
A Study on Multi-word Extraction from Chinese Documents
As a sequence of two or more consecutive individual words inherent with contextual semantics of individual words, multi-word attracts much attention from statistical linguistics an...
Wen Zhang, Taketoshi Yoshida, Xijin Tang
110
Voted
ACL
2011
14 years 3 months ago
Rare Word Translation Extraction from Aligned Comparable Documents
We present a first known result of high precision rare word bilingual extraction from comparable corpora, using aligned comparable documents and supervised classification. We in...
Emmanuel Prochasson, Pascale Fung