Sciweavers

9 search results - page 1 / 2
» Fully Unsupervised Word Segmentation with BVE and MDL
Sort
View
EMNLP
2010
13 years 2 months ago
An Efficient Algorithm for Unsupervised Word Segmentation with Branching Entropy and MDL
This paper proposes a fast and simple unsupervised word segmentation algorithm that utilizes the local predictability of adjacent character sequences, while searching for a leaste...
Valentin Zhikov, Hiroya Takamura, Manabu Okumura
ICPR
2000
IEEE
13 years 9 months ago
Unsupervised Segmentation of Poisson Data
This paper describes a new approach to the analysis of Poisson point processes, in time (1D) or space (2D), which is based on the minimum description length (MDL) framework. Speci...
Robert D. Nowak, Mário A. T. Figueiredo
ACL
2009
13 years 2 months ago
Bayesian Unsupervised Word Segmentation with Nested Pitman-Yor Language Modeling
In this paper, we propose a new Bayesian model for fully unsupervised word segmentation and an efficient blocked Gibbs sampler combined with dynamic programming for inference. Our...
Daichi Mochihashi, Takeshi Yamada, Naonori Ueda
NLPRS
2001
Springer
13 years 9 months ago
A Hierarchical EM Approach to Word Segmentation
We propose a simple two-level hierarchical probability model for unsupervised word segmentation. By treating words as strings composed of morphemes/phonemes which are themselves c...
Fuchun Peng, Dale Schuurmans