Sciweavers

168 search results - page 19 / 34
» Chinese Segmentation Disambiguation
Sort
View
LREC
2010
188views Education» more  LREC 2010»
14 years 11 months ago
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
We investigate the impact of input data scale in corpus-based learning using a study style of Zipf's law. In our research, Chinese word segmentation is chosen as the study ca...
Hai Zhao, Yan Song, Chunyu Kit
TALIP
2002
108views more  TALIP 2002»
14 years 9 months ago
Toward a unified approach to statistical language modeling for Chinese
This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu ...
IALP
2009
14 years 7 months ago
Two-Pass Deterministic Dependency Parsing for Long Chinese Sentences
This paper proposes a two-pass parsing approach to improve the performance of deterministic dependency parser for long Chinese sentences. In the first pass, the sentence is divided...
Ping Jian, Chengqing Zong
IRAL
2000
ACM
15 years 2 months ago
On the use of words and n-grams for Chinese information retrieval
: In the processing of Chinese documents and queries in information retrieval (IR), one has to identify the units that are used as indexes. Words and n-grams have been used as inde...
Jian-Yun Nie, Jianfeng Gao, Jian Zhang, Ming Zhou
ACL
2007
14 years 11 months ago
Automatic Discovery of Named Entity Variants: Grammar-driven Approaches to Non-Alphabetical Transliterations
Identification of transliterated names is a particularly difficult task of Named Entity Recognition (NER), especially in the Chinese context. Of all possible variations of trans...
Chu-Ren Huang, Petr Simon, Shu-Kai Hsieh