Sciweavers

13 search results - page 1 / 3
» Unknown Word Extraction for Chinese Documents
Sort
View
COLING
2002
13 years 4 months ago
Unknown Word Extraction for Chinese Documents
There is no blank to mark word boundaries in Chinese text. As a result, identifying words is difficult, because of segmentation ambiguities and occurrences of unknown words. Conve...
Keh-Jiann Chen, Wei-Yun Ma
WWW
2007
ACM
14 years 5 months ago
A search-based Chinese word segmentation method
In this paper, we propose a novel Chinese word segmentation method which leverages the huge deposit of Web documents and search technology. It simultaneously solves ambiguous phra...
Xin-Jing Wang, Yong Qin, Wen Liu
ICMCS
2007
IEEE
130views Multimedia» more  ICMCS 2007»
13 years 11 months ago
Word Topical Mixture Models for Extractive Spoken Document Summarization
This paper considers extractive summarization of Chinese spoken documents. In contrast to conventional approaches, we attempt to deal with the extractive summarization problem und...
Berlin Chen, Yi-Ting Chen
APWEB
2008
Springer
13 years 6 months ago
A Study on Multi-word Extraction from Chinese Documents
As a sequence of two or more consecutive individual words inherent with contextual semantics of individual words, multi-word attracts much attention from statistical linguistics an...
Wen Zhang, Taketoshi Yoshida, Xijin Tang
ACL
2011
12 years 8 months ago
Rare Word Translation Extraction from Aligned Comparable Documents
We present a first known result of high precision rare word bilingual extraction from comparable corpora, using aligned comparable documents and supervised classification. We in...
Emmanuel Prochasson, Pascale Fung