Sciweavers

49 search results - page 7 / 10
» Synthetic Word Parsing Improves Chinese Word Segmentation
Sort
View
COLING
2008
14 years 12 months ago
Chinese Dependency Parsing with Large Scale Automatically Constructed Case Structures
This paper proposes an approach using large scale case structures, which are automatically constructed from both a small tagged corpus and a large raw corpus, to improve Chinese d...
Kun Yu, Daisuke Kawahara, Sadao Kurohashi
84
Voted
ICDAR
2009
IEEE
15 years 5 months ago
Integrating Language Model in Handwritten Chinese Text Recognition
This paper describes a system for handwritten Chinese text recognition integrating language model. On a text line image, the system generates character segmentation and word segme...
Qiu-Feng Wang, Fei Yin, Cheng-Lin Liu
LREC
2010
188views Education» more  LREC 2010»
14 years 12 months ago
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
We investigate the impact of input data scale in corpus-based learning using a study style of Zipf's law. In our research, Chinese word segmentation is chosen as the study ca...
Hai Zhao, Yan Song, Chunyu Kit
TALIP
2002
108views more  TALIP 2002»
14 years 10 months ago
Toward a unified approach to statistical language modeling for Chinese
This paper presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) t...
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu ...
ACL
2010
14 years 8 months ago
Bitext Dependency Parsing with Bilingual Subtree Constraints
This paper proposes a dependency parsing method that uses bilingual constraints to improve the accuracy of parsing bilingual texts (bitexts). In our method, a targetside tree frag...
Wenliang Chen, Jun'ichi Kazama, Kentaro Torisawa