Information-Theoretic Segmentation of Natural Language

8 years 14 days ago

Download ceur-ws.org

Abstract. We present computational experiments on language segmentation using a general information-theoretic cognitive model. We present a method which uses the statistical regularities of language to segment a continuous stream of symbols into “meaningful units” at a range of levels. Given a string of symbols—in the present approach, textual representations of phonemes—we attempt to ﬁnd the syllables such as grea and sy (in the word greasy); words such as in, greasy, wash, and water; and phrases such as in greasy wash water. The approach is entirely information-theoretic, and requires no knowledge of the units themselves; it is thus assumed to require only general cognitive abilities, and has previously been applied to music. We tested our approach on two spoken language corpora, and we discuss our results in the context of learning as a statistical processes.

Sascha S. Griffiths, Mariano Mora McGinity, Jamie

Real-time Traffic

AIC 2015 |

claim paper

» Nonparametric Bayesian Segmentation of Japanese Noun Phrases

» Enhancing Chinese Word Segmentation Using Unlabeled Data

» Segmentation Standard for Chinese Natural Language Processing

» Bilingually Motivated DomainAdapted Word Segmentation for Statistical Machine Translation

» TBLImproved NonDeterministic Segmentation and POS Tagging for a Chinese Parser

» Exploiting Event Semantics to Parse the Rhetorical Structure of Natural Language Text

» Linear Text Segmentation using a Dynamic Programming Algorithm

» On the Use of Web Resources and Natural Language Processing Techniques to Improve Automati...

» A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Proce...

Post Info
More Details (n/a)

Added	14 Apr 2016
Updated	14 Apr 2016
Type	Journal
Year	2015
Where	AIC
Authors	Sascha S. Griffiths, Mariano Mora McGinity, Jamie Forth, Matthew Purver, Geraint A. Wiggins

Comments (0)

Sciweavers

Information-Theoretic Segmentation of Natural Language

AIC 2015 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers