Sciweavers

554 search results - page 29 / 111
» Stylistic text segmentation
Sort
View
ACL
2006
15 years 1 months ago
Unsupervised Segmentation of Chinese Text by Use of Branching Entropy
We propose an unsupervised segmentation method based on an assumption about language data: that the increasing point of entropy of successive characters is the location of a word ...
Zhihui Jin, Kumiko Tanaka-Ishii
LREC
2010
169views Education» more  LREC 2010»
14 years 6 months ago
Language Identification of Short Text Segments with N-gram Models
There are many accurate methods for language identification of long text samples, but identification of very short strings still presents a challenge. This paper studies a languag...
Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpi...
COLING
1996
15 years 1 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka
74
Voted
ICPR
2002
IEEE
16 years 26 days ago
Text Segmentation and Recognition in Complex Background Based on Markov Random Field
Datong Chen, Jean-Marc Odobez, Hervé Bourla...