Sciweavers

735 search results - page 76 / 147
» Corpora and data preparation
Sort
View
ICGI
1994
Springer
15 years 2 months ago
Inducing Probabilistic Grammars by Bayesian Model Merging
We describe a framework for inducing probabilistic grammars from corpora of positive samples. First, samples are incorporated by adding ad-hoc rules to a working grammar; subseque...
Andreas Stolcke, Stephen M. Omohundro
AIMSA
2004
Springer
15 years 1 months ago
Advances in Profile Assisted Voicemail Management
Abstract. Spoken audio is an important source of information available to knowledge extraction and management systems. Organization of spoken messages by priority and content can f...
Konstantinos Koumpis
INFORSID
2008
14 years 11 months ago
Processus global d'acquisition et de gestion des sigles
This paper deals with an acronym/definition extraction approach from textual data (corpora) and the disambiguation of these definitions (or expansions). Both steps of our global pr...
Mathieu Roche, Violaine Prince
ACL
2003
14 years 11 months ago
A Word-Order Database for Testing Computational Models of Language Acquisition
An investment of effort over the last two years has begun to produce a wealth of data concerning computational psycholinguistic models of syntax acquisition. The data is generated...
William Gregory Sakas
PVLDB
2010
112views more  PVLDB 2010»
14 years 8 months ago
Towards The Web of Concepts: Extracting Concepts from Large Datasets
Concepts are sequences of words that represent real or imaginary entities or ideas that users are interested in. As a first step towards building a web of concepts that will form...
Aditya G. Parameswaran, Hector Garcia-Molina, Anan...