Sciweavers

ACL
2012
11 years 6 months ago
Incremental Joint Approach to Word Segmentation, POS Tagging, and Dependency Parsing in Chinese
We propose the first joint model for word segmentation, POS tagging, and dependency parsing for Chinese. Based on an extension of the incremental joint model for POS tagging and ...
Jun Hatori, Takuya Matsuzaki, Yusuke Miyao, Jun-ic...
ACL
2012
11 years 6 months ago
Enhancing Statistical Machine Translation with Character Alignment
The dominant practice of statistical machine translation (SMT) uses the same Chinese word segmentation specification in both alignment and translation rule induction steps in buil...
Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang,...
ACL
2012
11 years 6 months ago
Fast Online Training with Frequency-Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection
We present a joint model for Chinese word segmentation and new word detection. We present high dimensional new features, including word-based features and enriched edge (label-tra...
Xu Sun, Houfeng Wang, Wenjie Li
ACL
2012
11 years 6 months ago
Using Rejuvenation to Improve Particle Filtering for Bayesian Word Segmentation
We present a novel extension to a recently proposed incremental learning algorithm for the word segmentation problem originally introduced in Goldwater (2006). By adding rejuvenat...
Benjamin Börschinger, Mark Johnson
ICDAR
2011
IEEE
12 years 3 months ago
A New Fourier-Moments Based Video Word and Character Extraction Method for Recognition
— This paper presents a new method based on Fourier and moments features to extract words and characters from a video text line in any direction for recognition. Unlike existing ...
Deepak Rajendran, Palaiahnakote Shivakumara, Bolan...
IJDAR
2011
223views more  IJDAR 2011»
12 years 7 months ago
ICDAR2009 handwriting segmentation contest
The Handwriting Segmentation Contest was organized in the context of ICDAR2009 conference in order to record recent advances in off-line handwriting segmentation. This paper descr...
Basilios Gatos, Nikolaos Stamatopoulos, Georgios L...
COLING
2010
12 years 11 months ago
Unsupervised phonemic Chinese word segmentation using Adaptor Grammars
Adaptor grammars are a framework for expressing and performing inference over a variety of non-parametric linguistic models. These models currently provide state-of-the-art perfor...
Mark Johnson, Katherine Demuth
ACL
2009
13 years 2 months ago
Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling
In this paper, we propose a new Bayesian model for fully unsupervised word segmentation and an efficient blocked Gibbs sampler combined with dynamic programming for inference. Our...
Daichi Mochihashi, Takeshi Yamada, Naonori Ueda
NAACL
2010
13 years 2 months ago
Urdu Word Segmentation
Word Segmentation is the foremost obligatory task in almost all the NLP applications where the initial phase requires tokenization of input into words. Urdu is amongst the Asian l...
Nadir Durrani, Sarmad Hussain
NAACL
2010
13 years 2 months ago
Is Arabic Part of Speech Tagging Feasible Without Word Segmentation?
In this paper, we compare two novel methods for part of speech tagging of Arabic without the use of gold standard word segmentation but with the full POS tagset of the Penn Arabic...
Emad Mohamed, Sandra Kübler